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The Cover shows the Cayley graph for the smallest non-Abelian simple group, the alternating group 
A$ (see §11.)- We will see in §16. that the simplicity of this group means there is no algebraic expression 
for any of the roots of the polynomial x 5 — 4x + 2 using the algebraic ingredients, 

| eQ,+,-,x,H-,^, ^, y, 

so therefore there can be no formula for the solutions of ax 5 + bx A + cx 3 + dx 2 + ex + f = that works 
for all possible a,b,c,d,e, f € C. 




The Cayley graph is a visual depiction of the multiplication in the group A5. The vertices correspond 
to the elements of the group as marked, the red edges to the particular element a = (1, 2, 3, 4, 5) and the 
black edges to r = (1, 2)(3, 4). The red pentagonal faces are oriented anticlockwise with respect to the 
outward pointing normal vector (use the right-hand rule), so that crossing a red edge in an anticlockwise 
direction corresponds to a and crossing in a clockwise direction corresponds to cr _1 (as in the diagram). 

The edges depict multiplication on the right: crossing a red edge anticlockwise (repectively clockwise) 
multiplies the label of the start vertex by a (resp. cr _1 ) to give the label of the finish vertex; crossing a 
black edge in either direction multiplies the label of the start vertex by r to give the label of the finish 
vertex 1 . 

Thus, the green sequence of edges gives the decomposition (1,4) (3, 5) = to^tg^ 1 and the blue 
sequence shows that (1, 2, 4, 5, 3)t<j~ 2 t — (1, 3, 5). 

It is a curious coincidence that the Cayley graph of the simplest non-Abelian simple group is the 
Buckminsterfullerene molecule: the simplest known pure form of Carbon. 

'The reason for the lack of orientation on the black edges is because the permutation r = T — . 
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jl. What is Galois Theory? 



A quadratic equation ax 2 + bx + c = has exactly two (possibly repeated) solutions in the complex 

numbers. We can even write an algebraic expression for them, thanks to a formula that first appears in the 

ninth century book Hisab al-jabr w'al-muqabala by Abu Abd- Allah ibn Musa al'Khwarizmi, and written 

in modern notation as, 

-b ± y/b 2 - 4ac 

x = • 

2a 

Less familiar maybe, ax 3 + bx 2 + cx + d = has three C-solutions, and they too can be expressed 
algebraically using Cardano's formula. For instance, one solution turns out to be, 



3a + 
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and the other two have similarly horrendous expressions. There is an even more complicated formula, 
attributed to Descartes, for the roots of a quartic polynomial equation. 

What is mildly miraculous is not that the solutions exist, but they can always be expressed algebraically 
in terms of the coefficients and the basic algebraic operations, 



+ 



1 1 ^ 1 • 1 



v> v> </> </> 



By the turn of the 19th century, no equivalent formula for the solutions to a quintic (degree five) poly- 
nomial equation had materialised, and it was Abels who had the crucial realisation: no such formula 
exists! 

Such a statement can be interpreted in a number of ways. Does it mean that there are always algebraic 
expressions for the roots of quintic polynomials, but their form is too complex for one single formula to 
describe all the possibilities? It would therefore be necessary to have a number, maybe even infinitely 
many, formulas. The reality turns out to be far worse: there are specific polynomials, such as x 5 — Ax + 2, 
whose solutions cannot be expressed algebraically in any way whatsoever. There is no formula for the 
roots of just this single polynomial, never mind all the others. 

A few decades after Abel's bombshell, Evariste Galois started thinking about the deeper problem: why 
don't these formulae exist? Thus Galois theory was originally motivated by the desire to understand, in a 
much more precise way than they hitherto had been, the solutions to polynomial equations. 

Galois' idea was this: study the solutions by studying their "symmetries" . Nowadays, when we hear 
the word symmetry, we normally think of group theory rather than number theory. Actually, to reach 
his conclusions, Galois kind of invented group theory along the way. In studying the symmetries of the 
solutions to a polynomial, Galois theory establishes a link between these two areas of mathematics. We 
illustrate the idea, in a somewhat loose manner, with an example. 



The symmetries of the solutions to x 3 — 2 = 0. 

(1.1) We work in C. Let a be the real cube root of 2, ie: a 
is a cube root of 1, and so uj 3 = 1. 



\/2 e M and, u 



\ + Note that u 



CtU) 




n / 



The three solutions to a; 3 — 2 = (or roots of x 3 — 2) are the complex 
numbers a, ctu and au; 2 , forming the vertices of the equilateral triangle shown. 
The triangle has what we might call "geometric symmetries": three reflections, 
a counter-clockwise rotation through | of a turn, a counter-clockwise rotation 



through | of a turn and a counter-clockwise rotation through | of a turn 



the 



identity symmetry. Notice for now that if n and T2 are the reflections in the 
lines^snown, the geometrical symmetries are n, r 2 , r 2 rir 2 , r 2 ri, (r 2 ri) 2 and (r 2 ri) 3 = 1 (read these 
expressions from right to left). 
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The symmetries referred to in the preamble are not so much geometric as "number theoretic". It will 
take a little explaining before we see what this means. 

(1.2) A field is a set F with two operations, called, purely for convenience, + and x , such that for any 

a, b, c G F, 

1. a + b and ax b(= ab from now on) are uniquely defined elements of F, 

2. a + (b + c) = (a + b) + c, 

3. a + b = b + a, 

4. there is an element £ F such that + a = a, 

5. for any a e F there is an element —a e F with (—a) + a = 0, 

6. a(bc) = (ab)c, 

7. ab = ba, 

8. there is an element 1 E F \ {0} with 1 x a = a, 

9. for any a^Oef there is an a -1 € F with aa _1 = 1, 
10. a(b + c) = ab + ac. 

A field is just a set of things that you can add, subtract, multiply and divide so that the "usual" rules of 
algebra are satisfied. Familiar examples of fields are Q, E and C; familiar examples of non-fields are Z, 
polynomials and matrices (you can't in general divide integers, polynomials and matrices to get integers, 
polynomials or matrices). 

(1.3) A subfield of a field F is a subset that also forms a field under the same + and x . Thus, Q is a 
subfield of M which is in turn a subfield of C, and so on. On the other hand, Q U {\/2} is not a subfield 
of E: it is certainly a subset but axiom 1 fails, as both 1 and V2 are elements but 1 + \[2 is not. 

Definition. If F is a subfield of the complex numbers C and [3 <G C, then F{(3), is the "smallest" subfield 
of C that contains both F and the number (J. 

What do we mean by smallest? That there is no other field F' having the same properties as F{(3) 
which is smaller, ie: no F' with F C F' and e F' too, but F' properly C F(J3). It is usually more 
useful to say it the other way around: 

If F' is a subfield that also contains F and /?, then F' contains F{j3) too. (*) 

Loosely speaking, F(J$) is all the complex numbers we get by adding, subtracting, multiplying and 
dividing the elements of F and (3 together in all possible ways. 

(1.4) To illustrate with some trivial examples, E(i) can be shown to be all of C: it must contain all 
expressions of the form bi for beK, and hence all expressions of the form a + bi with a, b E E, and this 
accounts for all the complex numbers; Q(2) is equally clearly just Q back again. 

Slightly less trivially, Q(\/2), the smallest subfield of C containing all the rational numbers and \/2 is 
a field that is strictly bigger than Q (eg: it contains \/2) but is much, much smaller than all of E. 

Exercise 1 Show that \/3 g Q(%/2). 
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(1.5) Returning to the symmetries of the solutions to x 3 — 2 = 0, we look at the field Q(a, lj), where 
a = \[2 e R and lj = — | + ^i, as before. Since Q(a, lo) is by definition a field, and fields are closed 
under + and x , we have 

a e Q(a, w) and lj £ Q(a, lj) => a x lj = au, a x u x u = au 2 G Q(a, u) too. 

So, Q(a, w) contains all the solutions to the equation x 3 — 2 = 0. On the other hand, 

Exercise 2 Show that Q(a, lo) has "just enough" numbers in it to solve the equation x 3 — 2 = 0. More precisely, Q(a, lj) is the 
smallest (in the sense (*))subfield of C that contains all the solutions to this equation, (hint: you may find it useful to do Exercise 5 
first). 

(1.6) A very loose definition of a symmetry of the solutions of x 3 — 2 = is that it is a "rearrangement" 
of Q(a, lj) that does not disturb (or is compatible with) the + and x . 

To see an example, consider the two fields Q(a, lj) and Q(a, lj 2 ). Despite first appearances they are 
actually the same: certainly 

a,u e Q(a,w) => a,Lo 2 e Q(a,u>). 
But Q(a, lj 2 ) is the smallest field containing Q, a and lj 2 , so by (*), 

Q(a,Lj 2 ) C Q(a,u). 

Conversely, 

a,u 2 x u 2 = u A = u e Q(a, lj 2 ) Q(a, lj) C Q(a, w 2 ). 

Remember that = 1 so lj 4 = lj. Thus Q(a, lj) and Q(a, lj 2 ) are indeed the same. In fact, we should 
think of Q(a, w) and Q(a, w 2 ) as two different ways of looking at the same field, or more suggestively, 
the same field viewed from two different angles. 

Whenever we hear the phrase, "the same field viewed from two different angles", we should imme- 
diately think that a symmetry is lurking-a symmetry that moves the field from the one point of view to 
the other. In the case above, there should be a symmetry of the field Q(a, lj) that puts it into the form 
Q(a, lj 2 ). Surely this symmetry should send 

a, and lj lj 2 . 

We haven't yet defined what we mean by, "is compatible with the + and x". It will turn out to mean 
that if a and lj are sent to a and lj 2 respectively, then a x lj should go to a x lj 2 ; similarly a x lj x lj 
should go to a x lj 2 x lj 2 = auj 4 = aw. The symmetry thus moves the vertices of the equilateral triangle 
determined by the roots in the same way that the reflection n of the triangle does 2 : 

OLLJ 

( 

au 2 

(1.7) In exactly the same way, we can consider the fields Q(au, lj 2 ) and Q(a, lj). We have 

a, lj e Q(a, lj) lj 2 , au e Q(a, lj) Q(aw, lj 2 ) C Q(a, lo); 
and conversely, au, lj 2 e Q(au, lo 2 ) aioco 2 — auj 3 = a e Q(au>, lj 2 ), and hence also 
aT x aLo = lo e Q(aw,w 2 ) Q(a,u>) C Q(aw,w 2 ). 



This compatability also means that it would have made no sense to have the symmetry send ohu 2 and uh». A symmetry 
should not fundamentally change the algebra of the field, so that if an element like uj cubes to give 1, then its image under the 
symmetry should too: but a doesn't cube to give 1. 
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a 



To be compatible with the + and x , 




a x u) i— » aw x w 2 = 



aw 3 = a, and a x ui x w aw x w 2 x w 2 = 



aw 5 = aw 2 . 



So the symmetry is like the reflection r 2 of the triangle: 

Finally, if we have two symmetries of the solutions to some equation, we would like their composition 
to be a symmetry too. So if the symmetries r\ and r 2 of the original triangle are to be considered, so 
should r 2 rir 2 , rir 2 , (rir 2 ) 2 and (rir 2 ) 3 = 1. 

(1.8) The symmetries of the solutions to x 3 — 2 = include all the geometrical symmetries of the 
equilateral triangle. We will see much later that any symmetry of the solutions is uniquely determined 
as a permutation of the solutions. Since there are 3! = 6 of these, we have accounted for all of them. It 
would appear then that the solutions to x 3 — 2 = have symmetry precisely the geometrical symmetries 
of the equilateral triangle. 

(1.9) If this was always the case, things would be very simple: Galois theory would just be the study 
of the "shapes" formed by the roots of polynomials, and the symmetries of those shapes. It would be a 
branch of planar geometry. 

But things are not so simple. If we look at the solutions to x 5 — 2 = 0, something quite different 
happens: 



We will see later on how to obtain these expressions for the roots. A pentagon has 10 geometric symme- 
tries, and you can check that all arise as symmetries of the roots of x 5 — 2 using the same reasoning as in 
the previous example. But this reasoning also gives a symmetry that moves the vertices of the pentagon 
according to: 



This is not a geometrical symmetry! Later we will see that for p > 2 a prime number, the solutions to 
x p — 2 = have p(p — 1) symmetries. While agreeing with the six obtained for x 3 — 2 = 0, it gives 
twenty for x 5 — 2 = 0. In fact, it was a bit of a fluke that all the number theoretic symmetries were also 
geometric ones for x 3 — 2 = 0. A p-gon has 2p geometrical symmetries and 2p < p(p — 1) with equality 
only when p — 3. 




a 



VE-l W5 + V5. 



4 4 



a = t/2 
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Exercise 3 Show that the figure on the left depicts a symmetry of the solutions to x 3 — 1 = 0, but the one on the right does not. 





Further Exercises for §1. 

1 

Exercise 4 You already know that the 3-rd roots of 1 are 1 and — - ± —^-i- What about the p-th roots for higher primes? 

1. If ui ^ 1 is a 5-th root it satisfies lv 4 + uj 3 + lo 2 + lo + 1 = 0. Let u = ui + u>~ 1 . Find a quadratic polynomial satisfied 
by u, and solve it to obtain u. 

2. Find another quadratic satisfied this time by lo, with coefficients involving u, and solve it to find explicit expressions for the 
four primitive 5-th roots of 1. 

3. Repeat the process with the 7-th roots of 1. 

factoid: the ra-th roots of 1 can be expressed in terms of field operations and extraction of pure roots of rationals for any n. The 
details (which are a little complicated!) were finally completed by the work of Gauss and Galois. 



Exercise 5 

1 . Let F be a field such that the element 



1 + 1 + ••• + 1^0, 



for any n > 0. Argueing intuitively, show that F contains a copy of the rational numbers Q (see also §4.). 
2. Give an example of a field where 



1 + 1 + ••• + 1 = 0, 



for some n. 



1 \/3 

Exercise 6 Let a = \/Z e K and lj = - + — i. Show that Q(a, w), Q(au> 2 ,u> 5 ) and Q(au> 4 , u> 5 ) are all the same field. 
Exercise 7 

1. Show that there is a symmetry of the solutions to x 5 — 2 = that moves the vertices of the pentagon according to: 




where a = y^2, and lv 5 = 1, lo e C. 
2. Show that the solutions in C to the equation x 6 — 5 = have 12 symmetries. 
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2. Polynomials, Rings and Polynomial Rings 



(2.1) There are a number of basic facts about polynomials that we will need. Suppose F is a field (Q, K. 
or C will do for now). A polynomial over F is an expression of the form 

f = a + aix H a n x n , 

where the en e F and x is a "formal symbol" (sometimes called an indeterminate). We don't tend to 
think of x as a variable-it is purely an object on which to perform algebraic manipulations. Denote the 
set of all polynomials over F by F[x). If a n ^ 0, then n is called the degree 3 of /, written deg(/). If the 
leading coefficient a n = 1, then / is monic. 

(2.2) We can add and multiply elements of F[x] in the usual way: 

n m 

if / = ^ a,iX % and g = ^ bix\ 

»=0 i=0 

then, 

max(m.n) m+n 

f + g= ^2 {a,i + bi)x l and fg = c fe x fc where c k = a,&j. (1) 

i=0 k=0 i+j=k 

that is, Cfc = a 6fc + ai&fc_i + • • • + afc&o- The arithmetic of the coefficients (ie: how to work out 
di + bi, dibj and so on) is just that of the field F. 

Exercise 8 Convince yourself that this multiplication is really just the "expanding brackets" multiplication of polynomials that 
you know so well! 

(2.3) The polynomials F[x] together with this addition form an example of a, 

Definition. A group is a set G endowed with an operation © such that for all a, b £ G, 

1 . a © b is a uniquely defined element of G (closure); 

2. affi(i®c) = (offi())®c (associativity); 

3. there is an e e G such that e© a = a = a® e (identity),; 

4. for any a e G there is an a -1 g G with a©a _1 = e = a~ 1 ©a (inverses). 

A group that also satisfies a © b — b © a for all a, b g G (commutativity) is said to be Abelian. 

With polynomials, the operation © is just the regular addition of polynomials. When the group opera- 
tion is "familiar" addition it is customary to use the symbols: + for ffi; for e and — for inverses. Thus 
the identity of F[x] as a group is the zero polynomial and inverses are given by 

(n \ n 

»=0 ' i=0 

Its also easy to see that F[x] forms an abelian group: for ,f + g = g + f exactly when ai + bi = bi + di for 
all i. But the coefficients of our polynomials come from the field F, and addition is always commutative 
in a field. 



3 In one of those triumphs of notation over intuition for which Mathematics is justifiably famous, define dcg(0) = — oo, whereas 
deg(A) = if A g F is not zero. The arithmetic of degrees is then just the arithmetic of non-negative integers, except we also 
need to decree that — oo + n = — oo. 
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(2.4) If we want to think about multiplication as well, we need the formal concept of, 

Definition. A ring is a set R endowed with two operations © and © such that for all a, b e R, 

1. R is an Abelian group under ©; 

2. for any a, b e i?, a © 6 is a uniquely determined element of R (closure of (g); 

3. a (g (6 (g c) = (a (g b) © c (associativity of ©); 

4. there is an / e i? such that f ® a — a — a® f (identity of <g>); 

5. a © (6 © c) = (a © 6) © (a © c) (the distributive law). 

Loosely, a ring is a set on which you can add (ffi), subtract (the inverse of © in the Abelian group) and 
multiply (®), but not necessarily divide (there is no inverse axiom for <g). 
Here are some well known examples of rings: 

Z, F[x] for F a field, Z„ and M n (F), 

where Z„ is addition and multiplication of integers modulo n and M n (F) are the n x n matrices, with 
entries from F, together with the usual addition and multiplication of matrices. 

A ring is commutative if the second operation © is commutative: a © b = b © a for all a, b. 

Exercise 9 

1. Show that fg = gf for polynomials /, g 6 i^x], hence F[x] is a commutative ring. 

2. Show that Z and Z„ are commutative rings, but M n (F) is not for any field F if n > 2. 

(2.5) The observation that Z and are both commutative rings is not just some vacuous formalism. 
A concrete way of putting it it this: at a very fundamental level, integers and polynomials share the same 
algebraic properties. 

When we work with polynomials, we need to be able to add and multiply the coefficients of the poly- 
nomials in a way that doesn't produce any nasty surprises-in other words, the coefficients have to satisfy 
the basic rules of algebra that we all know and love. But these basic rules of algebra can be found among 
the axioms of a ring. Thus, to work with polynomials successfully, all we need is that the coefficients 
come from a ring. 

This observation means that for a ring R, we can form the set of all polynomials with coefficients from 
R and add and multiply them together as we did above. In fact, we are just repeating what we did above, 
but are replacing the field F with a ring R. In practice, rather than allowing our coefficients to some from 
an arbitrary ring, we take R to be commutative. Since we are so used to our coefficients commuting with 
each other, this is probably a prudent precaution. This all leads to, 

Definition. Denote by R[x] the set of all polynomials with coefficients from some commutative ring R, 
together with the + and x defined at (1). 

Exercise 10 

1. Show that R[x] forms a ring. 

2. Since R[x] forms a ring, we can consider polynomials with coefficients from R[x] : take a new variable, say y, and consider 
R[x] [y] . Show that this is just the set of polynomials in two variables x and y together with the 'obvious' + and X . 

(2.6) A commutative ring R is called an integral domain iff for any a, b e R with a © b = e, we have 
a = e, or b = e or both. Clearly Z is an integral domain. 

Exercise 11 

1. Show that any field F is an integral domain. 

2. For what values of an integral domain? 
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Lemma 1 Let f,g£ R[x], R an integral domain. Then 
1- dcg(/.g) = deg(/) + dcg(p). 
2. R[x] is an integral domain. 

The second part means that given polynomials / and g (with coefficients from an integral domain), we 
have /<7 = 0=>/ = Oor<7 = 0. You have been implicitly using this fact for some time now when you 
solve polynomial equations by factorising them. 

Proof: We have 

fg = ^2 c k x k where c k = abj, 

k=0 i+j=k 

so in particular c m+n = a n b m ^ as R is an integral domain. Thus deg(fg) > m + n and since the 
reverse inequality is obvious, we have part (1) of the Lemma. Part (2) now follows immediately since 
fg = => deg(fg) = — oo => dog / + degg = — oo, which can only happen if at least one of / or g has 
degree = — oo (see the footnote at the bottom of the first page). □ 

All your life you have been happily adding the degrees of polynomials when you multiply them. But as 
the result above shows, this is only possible when the coefficients of the polynomial come from an integral 
domain. For example, %§, the integers under addition and multiplication modulo 6, is a ring that is not an 
integral domain (as 2 x 3 = for example), and sure enough, 

(3x + l)(2a; + l) = 5x + 1, 

where all of this is happening in Z 6 [x] . 

(2.7) Although we cannot necessarily divide two polynomials and get another polynomial, we can divide 
upto a possible "error term", or, as it is more commonly called, a remainder. 

Theorem A (The division algorithm). Suppose f and g are elements of R[x] where the leading coef- 
ficient of g has a multiplicative inverse in the ring R. Then there exist q and r in R[x] (quotient and 
remainder) such that 

f = qg + r, 

where either r = or the degree ofr is < the degree of g. 

When R is a field (where you may be more used to doing long division) all the non-zero coefficients 
of a polynomial have multiplicative inverses (as they lie in a field) so the condition on g becomes g ^ 0. 

Actually the name of the theorem is not very apt: it merely guarantees the existence of a quotient and 
remainder. It doesn't give us any idea how to find them (in other words, an algorithm). Compare the 
theorem with what you know about Z. There, we can also divide to get a remainder: when you divide 17 
by 3, it goes 5 times with remainder 2; in other words, 17 = 5x3 + 2. With integers, we are used to 
the remainder being smaller than the integer we are dividing by; in R[x] this condition is replaced by the 
degree of the remainder being strictly smaller than the degree of the divisor. 

Proof: For all q e R[x], consider those polynomials of the form / — gq and choose one, say r, of 
smallest degree. Let d — deg r and m = degg. We claim that d < m. This will give the result, as the r 
chosen has he form r = / — gq for some q, giving / = gq + r. Suppose that d> m and consider 

f=(r d )(g- 1 )x^g, 

a polynomial since d — m > 0. Notice also that we have used the fact that the leading coefficient of g has 
a multiplicative inverse. The leading term of f is rdx d , which is also the leading term of r. Thus, r — f 
has degree < d. Butr — f = / — gq — rdg^ m 1 x d ^ m g by definition, which equals f — g(q — r ( ig^ n 1 x d ^ m ) = 
f — gq, say. Thus r — f has the form f — gq too, but with smaller degree than r, which was of minimal 
degree amongst all polynomials of this form-this is our desired contradiction. □ 
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Exercise 12 

1 . If R is an integral domain, show that the quotient and remainder are unique. 

2. Show that the quotient and remainder are not unique when you divide polynomials in [x] . 

(2.8) Other familiar concepts from Z are those of divisors, common divisors and greatest common divi- 
sors. Since we need no more algebra to define these notions than is enshrined in the axioms for a ring, it 
should come as no surprise that these concepts carry pretty much straight over to polynomial rings. We 
will state these in the setting of polynomials from F[x] for F a field. 

Definition. For /, g e F[x], we say that / divides g iff g — fh for some he F[x]. Write / | g. 

Definition. Let f,g € F[x}. Suppose that d is a polynomial satisfying 

1 . d is a common divisor of / and g, ie: d | / and d | g; 

2. d is the greatest common divisor in the sense that any other common divisor must divide d (and so 
in particular be smaller!), ie: if c | / and c | g then c | d; 

3. cZ is monic. 

As with the division algorithm, we have tweaked the definition from Z to make it work in F[x]. The 
reason is that we want the gcd to be unique. In Z you ensure this by insisting that all gcd's are positive, 
otherwise, —3 would make a perfectly good gcd for 6 and 27; in F[x] we go for the monic condition 
(otherwise if d was a gcd of / and g, then 17 x d would be too). 

(2.9) x 2 — 1 and 2x 3 — 2x 2 — 4x6 Q[x] have greatest common divisor x + 1: it is certainly a common 
divisor as x 2 — 1 = (x+ l)(x— 1) and 2x 3 — 2x 2 — 4x = 2x(x + l)(x — 2). From the two factorisations, 
any other common divisor must have the form X(x + 1) for some A G Q, and so divides x + 1. 

(2.10) 

Theorem 1 Any two /, g e F[x] have a greatest common divisor d. Moreover, there are a a ,b a e F[x] 
such that 

d = aof + feoff- 
Compare this with Z! In fact, one may replace F[x] by Z in the following proof to obtain the corre- 
sponding fact for the integers. 

Proof: Consider the set I = {af + bg\a,b e i* 1 ^]}. Let d G / be a monic polynomial with minimal 
degree. Then del gives that d = ciof + bag for some ao, &o € F[x]. We claim that d is the gcd of / and 
g. The following two basic facts are easy to verify: 

1. The set I is a subgroup of the Abelian group F[x]-exercise. 

2. If u e I and w e F[x] then uw e /, since wu — w(af + bg) = (wa)f + (wb)g e /. 

Consider now the set P = {hd\h E F[x]}. Since del and by the second observation above, hd e /, 
and we have PCI. Conversely, if u £ I then by the division algorithm, u = qd + r where r = or 
deg(r) < deg(d). Now, r = u — qd and d e /, so qd € / by (2). But u e / and qd e I so u — dq = r e I 
by (1) above. Thus, if deg(r) < deg(<i) we would have a contradiction to the degree of d being minimal, 
and so we must have r = 0, giving u = qd. This means that any element of / is a multiple of d, so / C P. 

Now that we know that / is just the set of all multiples of d, and since letting a = 1,6 = or 
a = 0, b = 1 gives that /, g e /, we have that d is a common divisor of / and g. Finally, if d' is another 
common divisor, then / = uid' and g = u 2 d' ', and since d = a ,f + b g, we have d — a n uid' + b n u 2 d' = 
d'(aoUi + &0U2) gi ym g d' I d. Thus d is indeed the greatest common divisor. □ 
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(2.11) We have one more thing to say about polynomial rings. First, we need to recall a fundamental 
notion: 



Definition. Let R and S be rings. A mapping ip : R — > S is called a ring homomorphism if and only if 
for all a,b e R, 

1. ip(a + b) = ip(a) + ip(b); 

2. f(ab) — <p(a)ip(b); 

3. <f(f ) = f (where / is the multiplicative identity in R). 

In any ring of interest to us, the last item translates as ip(l) = 1. Why do we need this but not (p(0) = 0? 
Actually it's quite simple: we have (p(0) — (p(Q + 0) = (^(0) + (p(0) and since S is an Abelian group 
under addition, we can cancel (we are using the existence of inverses under addition!) to get (^(0) = 0. 
We can't do this to get ip(l) = 1 as we don't have inverses under multiplication, so we need to enshrine 
the desired property in the definition. 

You should think of a homomorphism as being like an "algebraic analogy", or a way of transferring 
algebraic properties; the algebra in the image of <p is analogous to the algebra of R. 

(2.12) We will have much more to say about general homomorphisms later on. For now, let's look at 
one in particular. Let R[x] be a ring of polynomials over a commutative ring R, and let A <E R. Define a 
mapping e\ : R[x] — > R by 

ex(f)=.f(X)=a + a 1 X + --- + a n X n . 

ie: substitute A into /. This is a ring homomorphism from R[x] to R, called the evaluation at X homo- 
morphism: to see this, certainly £a(1) = 1, and I'll leave s\(f + g) = £\(,f ) + s\(g) to you as its not 
hard. Now, 

£\(.fg) = £\ ( Y Ckxk ) = ^2 CkXk where Ck = ^2 aib 3- 

^ fe=0 ' k=0 i+j=k 

But EfeST c kX k = ^E"=o a » A ^ (ri7=o h 3 X1 ^j = £ a(./)£a(3) and we are done. 

One consequence of £\ being a homomorphism is that given a factorisation of a polynomial, say 
/ = gh, we have £\(f) = £\(g)e\(h), ie: if we substitute A into / we get the same answer as when 
we substitute into g and h and multiply the answers. This is another fact that appears to be trivial at first 
sight-you would have instinctively done this anyway no doubt. 

Further Exercises for §2. 
Exercise 13 Let /, g be polynomials over the field F and / = gh. Show that h is also a polynomial over F. 

Exercise 14 Let a : R ► S be a homomorphism of rings. Define a* : R[x] ► S[x] by 

i i 

Show that a* is a homomorphism. 

Exercise 15 let R be a ring and define d : R[x] > R[x] by 

n n 

d : ^ a k x k i ► ^2i ka k) xk ^ and 9( x ) = 0, 

k=0 k = l 

for any constant A. (Ring a bell?) Show that d(f + g) = d(f) + d(g) and d(fg) = d(f)g + fd(g). The map d is called the 
formal derivative. 
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§3. Roots and Irreducibility 

(3.1) Much of the material in this section is familiar in the setting of polynomials with R coefficients. 
The point is that these results are still true for polynomials with coefficients coming from an arbitrary 
field F, and quite often, for polynomials with coefficients from a ring R. 
Let 

/ = ao + aix H h a n x n 

be a polynomial in R[x] for R a ring. We say that A e R is a root of / if 

/(A) = a + on A H h a„A™ = in R. 

As a trivial example, the polynomial x 2 + 1 is in all three rings Q[x], M.[x] and C[x]. It has no roots in 
either Q or R, but two in C. 

Aside. "I thought that we weren't thinking of x as a variable!", I hear you say. In fact we don't need to, as long as we are prepared 
to think a little more abstractly about something we have been happily doing intuitively for a while now. Here is how: we say that 
A is a root of / if and only if there is a homomorphism ip : R[x] — > R such that ip restricts to the identity on R, ie: <p(a) = a for 
all a g R, and also that ip(x) = A and <p(f) = 0. In fact you see that the homomorphism needed is the evaluation homomorphism 

(3.2) 

The Factor Theorem. An element A e R is a root off if and only iff = (x — \)g for some g € R[x\. 
In English, A is a root exactly when x — A is a factor. 

Proof: This is an illustration of the power of the division algorithm, Theorem A. Suppose that / has the 
form (x — X)g for some g e R[x]. Then 

/(A) = (A - A).g(A) = 0,g(A) = 0, 

so that A is indeed a root (notice we used that e\ is a homomorphism, ie: that £\(f) — s\(x — \)e\{g)). 
On the other hand, by the division algorithm, we can divide / by the polynomial x — A to get, 

/ = i x - A ).9 + A*j 

where fj, e R (we can use the division algorithm, as the leading coefficient of x — A, being 1, has an 
inverse in R). Since /(A) = 0, we must also have (A — \)g + fi = 0, hence yL — 0. Thus / = (x — X)g 
as required. □ 

(3.3) Here is another result that you probably already know to be true for polynomials over the reals, 
complexes, etc. Reassuringly, it is true for polynomials with coefficients from (almost) any ring. 

Theorem 2 Let f <G R[x] be a non-zero polynomial with coefficients from the integral domain R. Then 
f has at most deg(/) roots in R. 

Proof: We use induction on the degree which is > since / is non-zero. If deg(/) = then / = /ja 
nonzero constant in R, which clearly has no roots, so the result holds. Assume deg(/) > 1 and that the 
result is true for any polynomial of degree < deg(/). If / has no roots in R then we are done. Otherwise, 
/ has a root A e R and 

f = {x- \)g, 

for some g e R[x] by the factor theorem. Moreover, as R is an integral domain, /(/x) = iff either 
/i — A = or g{p) = 0, so the roots of / are A, together with the roots of g. Since the degree of g must 
be deg(/) — 1 (by Lemma 1, again using the fact that R is an integral domain), it has at most deg(/) — 1 
roots by the inductive hypothesis, and these combined with A give at most deg(/) roots for /. □ 
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(3.4) As the theorem indicates, a cherished fact such as this might not be true if the coefficients of our 
polynomial do not come from an integral domain. For instance, if R = Zq, then the quadratic polynomial 

(x - \){x - 2) = x 2 + 3x + 2 has roots 1, 2, 4 and 5 in Z 6 . 

(3.5) Notice that when we say that / £ R[x], all we are claiming is that the ring R is big enough to contain 
the coefficients of /. So x 2 + 1 is equally at home in Q[x] , R[x] and C[x] (not to mention Q(i) [x] . . .). 

This observation and the theorem mean that a polynomial has at most its degree number of roots in any 
ring that contains its coefficients. Put another way, we may become comfortable with the idea of creating 
"new" numbers to solve equations (for example, the creation of C to solve x 2 + 1 = 0), but there will 
always be a limit to our inventiveness-you will never find more than two solutions to x 2 + 1 = 0, now 
matter how many "new numbers" you make up. 

Exercise 16 A polynomial like x 2 + 2x + 1 = (x + l) 2 has 1 as a repeated root. It's derivative, in the sense of elementary 
calculus, is 2(x + 1), which also has 1 as a root. In general, and in light of the Factor Theorem, call Agfa repeated root of / iff 
/ = (x — X) k g for some k > 1. 

1. Using the formal derivative d (see Exercise 15), show that A is a repeated root of / if and only if A is a root of d(f). 

2. Show that / has no repeated roots, ie: the roots of / are distinct, if and only if gcd(/, 8(f)) = 1. 

(3.6) For reasons that will become clearer later, a very important role is played by polynomials that cannot 
be "factorised". 

Definition. Let F be a field and / <E F[x] a non-constant polynomial. A non-trivial factorisation of / is 
an expression of the form / = gh, where g, h £ F[x] and deg g, deg h > 1. Say / is reducible over F iff 
it has a non-trivial factorisation, and irreducible over F otherwise. 

Thus, a polynomial over a field F is irreducible precisely when it cannot be written as a product of 
non-constant polynomials. Another way of putting it is to say that f £ F[x] is irreducible precisely when 
it is divisible only by a constant fi, or x f. 

Aside. You can also talk about polynomials being irreducible over a ring (eg: over Z). The definition is slightly more complicated 
however: let / £ R[x] a non-constant polynomial with coefficients from the ring R. A non-trivial factorisation of / is an expression 
of the form / = gh, where g,h £ R[x] and either, 

1. deg g, deg h > 1, or 

2. if say g = A g _R is a constant, then A has no multiplicative inverse in R. 

Say / is reducible over R iff it has a non-trivial factorisation, and irreducible over R otherwise. Notice that if _R = F a field, then 
the second possibility never arises, as every non-zero element of F has a multiplicative inverse. 

The reason for the extra complication in the definition is that if A 6 R is a constant which does have a multiplicative inverse in 
R, then you can always write 

/ = A(A- 1 /)- 

So pulling out such constants is too easy! As an example, 3x + 3 = 3(x + 1) is a non-trivial factorisation in Z[x] but a trivial one 
inQ[x]. 

The "over F" that follows reducible or irreducible is crucial; polynomials are never absolutely re- 
ducible or irreducible in any sense. An obvious example is x 2 + 1, which is irreducible over R but 
reducible over C. 

(3.7) There is an exception to this, and it is that a linear polynomial / = ax + b £ F[x] is irreducible 
over any field F: if / = gh then since deg / = 1, we cannot have both dcg(g), deg(h) > 1, for then 
deg(gh) — deg (5) + deg(/i) > 1 + 1 = 2, a contradiction. Thus, one of g or h must be a constant with / 
thus irreducible over F. So maybe we can qualify the statement above: linear polynomials are absolutely 
irreducible (we don't need to mention the field), but that's it! 

Exercise 17 

1 . Let F be a field and A 6 F. Show that / is an irreducible polynomial over F if and only if A/ is irreducible over F for any 

2. Show that if f(x + A) is irreducible over F then f(x) is too. 
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(3.8) There is the famous, 

Fundamental Theorem of Algebra. Any non-constant / e C[i] has a root in C. 

So if / € C[x] has deg / > 2, then / has a root in C, hence a linear factor over C, hence is reducible 
over C. Thus, the only irreducible polynomials over C are the linear ones. 

Aside. Actually, the fundamental theorem of algebra has been described as neither fundamental nor about algebra! Later we will 
be able to prove it from something known as the Galois correspondence, which also happens to be called the Fundamental Theorem 
of Galois Theory. Now, if you take the view that Galois theory is a subset of algebra, then it does seem rather odd that a theorem 
supposedly fundamental to all of algebra can be proved from a theorem that is merely fundamental to a part of it. 

Exercise 18 Show that if / is irreducible over R then / is either linear or quadratic. 

(3.9) A very common mistake is to think that having no roots in F is the same thing as being irreducible 
over F. In fact, the two are not even remotely the same thing. 

Just because a polynomial is irreducible over F does not mean that it has no roots in the field: we saw 
above that a linear polynomial ax + b is always irreducible, and yet has a root in F, namely — b/a. It is 
true though that if a polynomial / has degree > 2 and had a root in F, then by the factor theorem it would 
have a linear factor so would be reducible. Thus, if deg(/) > 2 and / is irreducible over F, then / has 
no roots in F. 

A polynomial that has no roots in F is not necessarily irreducible over the field: the polynomial 
x 4 + 2x 2 + 1 = (x 2 + l) 2 is reducible over Q, but with roots ±i £ Q. 

(3.10) There is no general method for deciding if a polynomial over an arbitrary field F is irreducible: 
the situation is not dissimilar to that of integration in calculus. There is no list of rules that collectively 
apply to all situations. The best we can hope for is an ever expanding list of techniques, of which this is 
the first: 

Proposition 1 Let F be afield and f € F[x] be a polynomial of degree < 3. Iff has no roots in F then 
it is irreducible over F. 

Proof: Arguing by contradiction, if / is reducible then / = gh with deg g, deg h > 1. Since degg + 
deg h = deg/ < 3, we must have for g say, that degg = 1. Thus / = (ax + b)h and / has the root 

(-bxa- 1 ). □ 

Exercise 19 We need a new field to play with. Let p be a prime and F p the set {0, 1 . . . , p — 1}. Define addition and multiplication 
on this set to be addition and multiplication of integers modulo p. 

1 . Verify that ¥ p is a field by checking the axioms. The only tricky one is the existence of inverses under multiplication: use 
the gcd theorem from §2. (but for Z rather than polynomials). 

2. Show that a field F is an integral domain. Hence, show that if n is not prime, then the addition and multiplication of integers 
modulo n is not a field. 

(3.11) Consider polynomials with coefficients from, say, F 2 , ie: the ring ¥ 2 [x], and in particular, the 
polynomial 

f = x 4 + x + l e¥ 2 [x}. 

Now, 4 + 0+ l ^ ^ l 4 + 1 + 1, so / has no roots in F 2 . Although this is a good start, we are in 
no position to finish, as the Proposition above does not apply to quartics. But we can certainly say that 
any factorisation of / over F 2 , if there is one, must be as a product of two quadratics. Moreover, these 
quadratics must themselves be irreducible over F 2 , for if not, they would factor into linear factors and the 
factor theorem would give roots of /. 
There are only four quadratics over F 2 : 

x 2 , x 2 + 1, x 2 + x and x 2 + x + 1. 
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The first two are reducible as they have roots and 1 respectively; the third is also reducible with both 
and 1 as roots. By the Proposition above, the last is irreducible. Thus, any factorisation of / into 
irreducible quadratics must in fact be of the form, 

(x 2 +x + l){x 2 +x + l). 

Unfortunately, / doesn't factorise this way (just expand the brackets). Thus / is irreducible over F 2 . 

(3.12) As we delve deeper into Galois theory, it will transpire that Q is where much of the action happens. 
Consequently, determining the irreducibility of polynomials over Q will be of great importance. The first 
useful test for irreducibility over Q has the following main ingredient: to see if a polynomial can be 
factorised over Q it suffices to see whether it can be factorised over Z. 

First we recall Exercise 14, which is used a number of times in these notes so is worth placing in a, 

Lemma 2 Let a : J? — > S be a homomorphism of rings. Define a* : R[x] — > S[x] by 

a* : ^ aiX 1 i— » ^ a(ai)x l . 

i i 

Then a* is a homomorphism. 

Lemma 3 (Gauss) Let f be a polynomial with integer coefficients. Then f can be factorised non-trivially 
as a product of polynomials with integer coefficients if and only if it can be factorised non-trivially as a 
product of polynomials with rational coefficients. 

Proof: If the polynomial can be written as a product of Z-polynomials then it clearly can as a product 
of Q-polynomials as integers are rational! Suppose on the otherhand that / = gh in Q[x] is a non-trivial 
factorisation. By multiplying through by a multiple of the denominators of the coefficients of g we get a 
polynomial gi = rag with Z-coefficients. Similarly we have h\ = nh e 7L\x\ and so 

mnf = 9l h! e Z[x\. (2) 

Now let p be a prime dividing ran, and consider the homomorphism a : Z — > F p given by a(k) = k 
mod p. Then by the lemma above, the map a* : Z[x] — > ¥ p [x] given by 

a* : ^ a,iX l i— > ^ a(ai)x l , 

i i 

is a homomorphism. Applying the homomorphism to (2) gives = a* (.9i)er* (hi) in ¥ p [x], as ran 
mod p. As the ring ¥ p [x] is an integral domain the only way that this can happen is if one of the 
polynomials is equal to the zero polynomial in F p [x], ie: one of the original polynomials, say g\, has all 
of its coefficients divisible by p. Thus we have gi — pg 2 with g 2 € Z[x], and (2) becomes 

mn 

/ = 52^1 ■ 

P 

Working our way through all the prime factors of mn in this way, we can remove the factor of mn from 
(2) and obtain a factorisation of / into polynomials with Z-coefficients. □ 

So to determine whether a polynomial with Z-coefficients is irreducible over Q, one need only check 
that it has no non-trivial factorisations with all the coefficients integers. 

Eisenstein Irreducibility Theorem. Let 

f = c n x n H h cix + c , 

be a non-linear polynomial with integer coefficients. If there is a prime p that divides all the Ci for i < n, 
does not divide c„ and such that p 2 does not divide c , then f is irreducible over Q. 
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Proof: By virtue of the fact above, we need only show that under the conditions stated, there is no 
factorisation of / using integer coefficients. Suppose otherwise, ie: / = gh with 



g = a r x r H + ao and h = b s x s + ■ ■ ■ + b a 



and the aj, b{ G Z. Expanding gh and equating coefficients, 



c 
ci 



a b 

a 6i + ai&o 



a bi + aibi-i + h a t bo 



c, 



n 



By hypothesis, p \ cq. Write both a and 60 as a product of primes, so if p | Cq, ie: p \ a^bo, then p must 
be one of the primes in this factorisation, hence divides one of do or bo. Thus, either p \ ao or p \ bo, but 
not both (for then p 2 would divide Co). Assume that it is p \ a that we have. Next, p \ c\, and this coupled 
with p I a gives p \ c\ — a bi = aib (If we had assumed p \ b , we would still reach this conclusion). 
Again, p must divide one of the these last two factors, and since we've already decided that it doesn't 
divide b , it must be ax that it divides. Continuing in this manner, we get that p divides all the coefficients 
of g, and in particular, a r . But then p divides a r b s = c„, the contradiction we were after. □ 

As a meta-mathematical comment, the proof of Eisenstein irreducibility is a nice example of the manner 
in which mathematics is created. You start with as few assumptions as possible (in this case that p divides 
some of the coefficients of /) and proceed towards some sort of conclusion, imposing extra conditions 
as and when you need them. In this way the correct statement of the theorem writes itself in an organic 
fashion. 

(3.13) To show the power of the result, we get immediately that 

x A - 5x 3 + 10x 2 + 25x - 35, 
is irreducible over Q, a fact not easily shown another way. Even more useful, we have 



is irreducible over Q for any prime p. Thus, we can find polynomials over Q of arbitrary large degree that 
are irreducible, which is to be contrasted strongly with the situation for polynomials over R or C. 

(3.14) It turns out that there is a fundamental connection between the multitude of irreducible polynomials 
over Q (and the relative paucity of them over M. and C) and the empirical observation that there are lots 
of fields a "little bigger" than Q (for example, Q(V%) and Q(a, u)) from §1.), but very few fields a "little 
bigger" than R or C. 

(3.15) Another useful tool arises when you have polynomials with coefficients from some ring R and a 
homomorphism from R to some field F. If the homomorphism is applied to all the coefficients of the 
polynomial (turning it from a polynomial with i?-coefficients into a polynomial with F-coefficients), then 
a reducible polynomial cannot turn into an irreducible one. The precise statement goes by the name of: 

The Reduction Test. Let R be an integral domain, F a field and a : R — > F a ring homomorphism. 
Let 0* : R[x] — > F[x] be given by 




be the homomorphism of Lemma 2. Moreover, let p € R[x] be such that 
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1. dcg a* (p) — deg(p), and 

2. a* (p) is irreducible over F. 

Then p cannot be written as a product gh with g,h G R[x] and deg g, deg h < degp. 

Although it is stated in some generality, the reduction test is very useful for determining the irreducibil- 
ity of polynomials over Q. As an example, take R = Z; F = F 5 and p = 8x 3 — 6x — 1 G l\x\. For a, 
take reduction modulo 5, ie: a(n) = n mod 5. It is not hard to show that a is a homomorphism. Since 
<t(8) = 3 mod 5, and so on, we get 

a*{p) = 3x 3 +4x + 4 G F 5 [4 

Clearly, the degree has not changed, and by substituting the five elements of F 5 into a* (p), one can see 
that it has no roots in F 5 . Since the polynomial is a cubic, it must therefore be irreducible over F 5 . Thus, 
by the reduction test, 8x 3 — 6x — 1 cannot be written as a product of smaller degree polynomials with 
Z-coefficients. But by Gauss' lemma, this gives that this polynomial is irreducible over Q. 
F 5 was chosen because with F 2 instead, condition (i) fails; with F 3 , condition (ii) fails. 

Proof: Suppose on the contrary that p = gh with deg g, deg h < degp. Then a*(p) — a*(gh) = 
<j*(g)a*(h), the last part because a* is a homomorphism. Now a*(p) is irreducible, so the only way it 
can factorise like this is if one of the factors, a* (g) say, is a constant, hence deg a* (g) = 0. Then 

degp = dcg<r*(p) = deg a*(g)a*(h) = dcg<r*(g) + dcg ct*(/i) = deg &*{h) < dcg h < degp, 

a contradiction. That dega*(h) < deg h rather than equality necessarily, is because the homomorphism 
a may send some of the coefficients of h (including quite possibly the leading one) to G F. □ 

(3.16) Our final tool requires a little more set-up. We've already observed the similarity between poly- 
nomials and integers. The idea of irreducibility in Z is just that of a prime number, and perhaps this goes 
some way to indicating its importance for polynomials as well. One thing we know about integers is 
that they can be written uniquely as a product of primes. We would hope that something similar is true 
for polynomials, and it is in certain situations. For the next few results, we deal only with polynomials 
/ G F[x] for F a field (they are actually true in more generality, but this is beyond the scope of these 
notes). In what follows, it is worth comparing the situation with what you know about Z. 

Lemma 4 1. If gcd(f, g) = 1 and f \ gh then f \ h. 

2. If f is irreducible and monic, then for any g monic with g \ f we have either g = 1 or g = f. 
3.1fg is irreducible and monic and g does not divide f, then gcd(g, /) = 1. 
4. If g is irreducible and monic and g\f\fi ■ ■ ■ f n then g\fi for some i. 

Proof: 1. Since gcd(/, g) — 1 there are a, b G F[x] such that 1 = af + bg, hence h = afh + bgh. 
We have that / | bgh by assumption, and it clearly divides afh, hence it divides afh + bgh = h 
also. 

2. If g divides / and / is irreducible, then by definition g must be either a constant or a constant 
multiple of /. But / is monic, so g = 1 or g = f are the only possibilities. 

3. The gcd of / and g is certainly a divisor of g, and hence by irreducibility must be either a constant, 
or a constant times g. As g is also monic, the gcd must in fact be either 1 or g itself, and since g 
does not divide / it cannot be g, so must be 1. 

4. Proceed by induction, with the first step for n — 1 being immediate. Since g \ /i/ 2 . . . /„ = 
(/1/2 • • • /n-i)/«> we either have g \ /„, in which case we are finished, or not, in which case 
gcd(g, /„) = 1 by part (3). But then part (1) gives that g \ /1/2 . . . /„-i, and the inductive hypoth- 
esis kicks in. 

□ 
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Perhaps the best way of summarising the lemma is this: monic irreducible polynomials are like the 
"prime numbers" of F[x}. 

(3.17) And just as any integer can be decomposed uniquely as a product of primes, so too can any 
polynomial as a product of irreducible polynomials: 

Unique factorisation in F[x}. Every polynomial in F[x] can be written in the form 

Xp!P2 ...p r , 

where X is a constant and the pi are monic and irreducible G F[x]. Moreover, if ^<7i<?2 . . . q s is another 
factorisation with the qj monic and irreducible, then r = s, A = \i and the qj are just a rearrangement 
of the p^ 

The last part says that the factorisation is unique, except for trivial matters like the order you write 
down the factors. Like many such results in mathematics, the first impression is that the existence of the 
factorisation is the useful part, but in fact it is the uniqueness that really is. 

Proof: To get the factorisation in the first place is easy enough: just keep factorising reducible polyno- 
mials until they become irreducible. At the end, pull out the coefficient of the leading term in each factor, 
and place them all at the front. 
For uniqueness, suppose that 

\pip 2 ...p r = /i<7l<?2 • • -q s - 

Then p r divides [iqiq2 ■ ■ ■ q s which by Lemma 4 part (4) means that p r \ qt for some i. Reorder the q's so 
that it is p r I q s that in fact we have. Since both p r and q s are monic, irreducible, and hence non-constant, 
p r = q s , which leaves us with 

Xp!P2 ■ ■ -Pr-l = Ml ?2 • • • Qs-1- 

This gives r = s straight away: if say s > r, then repetition of the above leads to A = fiq\q2 ■ ■ ■ q s -r, 
which is absurd, as consideration of degrees gives different answers for each side. Similarly if r > s. But 
then we also have that the p's are just a rearrangement of the q's, and canceling down to Xpi = \iq\, that 
A = fi. □ 

(3.18) It is worth repeating that everything depends on the ambient field F, even the uniqueness of the 
decomposition. For example, x 4 — 4 decomposes as, 

{x 2 + 2){x 2 -2)inQ[4 

(a; 2 + 2){x- V2)(x + \/2) in R[x] and 

(x - V2i)(x + s/2i)(x -V2)(x + y/2) in C[4 

To illustrate how unique factorisation can be used to determine irreducibility, we have in C[x] that, 

x 2 + 2= (x - V2i)(x + V2i). 

Since the factors on the right are not in R[x] we have an inkling that this polynomial is irreducible over 
R. To make this more precise, any factorisation in R[x] would be of the form 

x 2 + 2= (x- Ai)(x-A 2 ) 

with the Aj G R. But this would be a factorisation in C[x] too, and there is only one such by unique 
factorisation. This forces the A, to be y/2i and —V2i, contradicting Aj G R. Hence x 2 + 2 is indeed 
irreducible over R. Similarly, x 2 — 2 is irreducible over Q. 

Exercise 20 Formulate the example above into a general Theorem. 
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Further Exercises for §3. 
Exercise 21 Prove that if a polynomial equation has all its coefficients in C then it must have all its roots in C. 

Exercise 22 

1. Let / = a„x n + a n -\x n ^ x + • • ■ + aix + an be a polynomial inR[x], that is, all the a, g K. Show that complex roots 
of / occur in conjugate pairs, ie: £ g C is a root of / if and only if J is. 

2. Find an example of a polynomial in C[x] for which part (a) is not true. 
Exercise 23 

1. Let m, n and fc be integers with m and n relatively prime (ie: gcd(m, n) = 1). Show that if m divides nk then m must 
divide k (hint: there are two methods here. One is to use Lemma 4 but in Z. The other is to use the fact that any integer can 
be written uniquely as a product of primes. Do this for m and n, and ask yourself what it means for this factorisation that 
m and n are relatively prime). 

2. Show that if m/n is a root of do + aix + ... + a r x r , cii g Z, where m and n are relatively prime integers, then m|an 
and n|a r (hint: use the first part!). 

3. Deduce that if a r = 1 then m/n is in fact an integer. 

moral: If a monic polynomial with integer coefficients has a rational root m/n, then this rational number is in fact an integer. 

Exercise 24 If m g Z is not a perfect square, show that x 2 — m is irreducible over Q (note: it is not enough to merely assume 
that under the conditions stated s/m is not a rational number). 

Exercise 25 Find the greatest common divisor of f(x) = x z — 6x 2 + x + 4 and g(x) = x 5 — Qx + 1 (hint: look at linear factors 
of /Or)). 

Exercise 26 Determine which of the following polynomials are irreducible over the stated field: 

1. l + x 8 overM; 

2. 1 + x 2 + x A + x 6 + x 8 + x 10 over Q (hint: Let y = x 2 and factorise y n - 1); 

3. x 4 + 15x 3 + 7 over M (hint: use the intermediate value theorem from analysis); 

4. x n+1 + (n + 2)! x n H h (i + 2)! x' H + 3! x + 2! over Q. 

5. x 2 + 1 overF 7 . 

6. Let F be the field of order 8 from §4., and let ¥[X] be polynomials with coefficients from F and indeterminate X. Is 
X 3 + (a 2 + a)X + (a 2 + a + 1) irreducible over F? 

7. a4X* + <23X 3 + aix 2 + a\x + an over Q where the aj g Z; 03, 02 are even and 04, 01, an are odd. 
Exercise 27 If p is a prime integer, prove that p is a divisor of f ^ ^ , for < i < p. 

Exercise 28 Show that 




is irreducible over Q. 

Exercise 29 A complex number w is an n-th root of unity if u n = 1. It is a primitive n-th root of unity if oj™ = 1, but uf ^ 1 
for any < r < n. So for example, ±1, ±i are the 4-th roots of 1, but only ±i are primitive 4-th roots. 
Convince yourself that for any n, 

2vr 2vr 
u> = cos h i sin — 

n n 

is an n-th root of 1. In fact, the other n-th roots are ui , . . . , w n = 1. 

1 . Show that if w is a primitive n-fh root of 1 then oj is a root of the polynomial 

a;"" 1 + 3^-2 + . . . + x + 1. (3) 

2. Show that for (3) to be irreducible over Q, n cannot be even. 

3. Show that a polynomial f(x) is irreducible over a field F if f(x + 1) is irreducible over F. 

4. Finally, if 

* p (x) = xf- 1 + x p - 2 + • • • + x + 1 

forp a prime number, show that & p (x + 1) is irreducible over Q, and hence <fr v (x) is too (hint: consider x p — 1 and use 
the binomial theorem, Exercise 27 and Eisenstein). 

The polynomial <J>p(x) is called the p-th cyclotomic polynomial. 
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4. Fields I: Basics, extensions and concrete examples 



(4.1) This course is primarily the study of solutions to polynomial equations. Broadly speaking, questions 
in this direction can be restated as questions about fields. It is to these that we now turn. 

(4.2) We remembered the definition of a field in Lecture §1.. Since then we have become more familar 
with rings, so we can restate the definition as: 

Definition. Afield is a set F with two operations, © and ®, such that for any a,b,c e F, 

1. F is an Abelian group under © (with © normally called just +, e called 0, and a^ 1 called -a), 

2. F \ {0} is an Abelian group under © (with © normally called just x, and / called 1), 

3. the two operations are linked by the distributive law. 

The two groups are called the additive and multiplicative groups of the field. In particular, we will 
write F* to denote the multiplicative group (ie: F* is the group with elements F \ {0} and operation the 
multiplication from the field). Even more succinctly, 

Definition. A field is a set F with two operations, © and ©, such that for any a, b, c e F, 

1 . F is a commutative ring under © and © (with © normally called just +, e called 0, inverses under 
© called —a, © just x , and / called 1), 

2. for any a e F\ {e} there is an a -1 e F with a © a -1 = / = a -1 © a, 
In particular, afield is a very special kind of ring. 

(4.3) More concepts from the first lecture that can now be properly defined are: 

Definition. Let F and E be fields with F a subfield of E. We call E an extension of F . The standard 
notation for an extension is to write E/F, but in these notes we will use the more concrete F C E, being 
mindful at all times that this means F is a subfield of E, and not just a subset. 

If /? € E, we write, as in §1., F{j3) for the smallest subfield of E containing both F and j3 (so 
in particular F{j3) is an extension of F). In general, if . . . , (3 k e E, define F(0i, . . . , (3 k ) = 
^(/3 1 ,...,/J fe _ 1 )(/3 fe ). 

We say that j3 has is adjoined to F to obtain F(j3). The last bit of the definition just says that to adjoin 
several elements to a field, you just adjoin them one at a time 4 . Finally, if we have an extension F C E 
and there is a (3 e E such that E = F(j3), then we call E a simple extension of F. 

(4.4) Trivially, R is an extension of Q; C is an extension of R, and so on. Any field is equally trivially an 
extension of itself! 

(4.5) Let F 2 be the field of integers modulo 2 arithmetic. Let a be an "abstract symbol" that can be 
multiplied so that it has the following property: a x a x a = a 3 = a + 1 (a bit like decreeing that the 
imaginary i squares to give —1). Let 

F = {a + ba + co? \ a, b, c e F 2 }, 

Define addition on F by: (ai + bia + c\c?) + (a 2 + b2a + c 2 a 2 ) = (ai +a 2 ) + (&i +b 2 )a+ (c\ +c 2 )a 2 , 
where the addition of coefficients happens in F 2 . For multiplication, "expand" the expression (ai + b\a + 
Cia 2 )(a 2 + b 2 a + c 2 a 2 ) like you would a polynomial with a the indeterminate, so that acta. = a 3 , the 
coefficients are dealt with using the arithmetic from F 2 , and so on. Replace any a 3 that result using the 
rule a 3 = a + 1. 

4 Although the definition has you adjoining them in a particular order, the order doesn't matter. 
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For example, 



(1 + a + a 2 ) + (a + a 2 ) = 1 and (1 + a + a 2 )(a + a 2 ) = a + a 4 = a + a(a + 1) = a 2 . 

It turns out that F forms a field with this addition and multiplication, see Exercise 40. For now we content 
ourselves with the following observation: taking those elements of F with b = c = 0, we obtain (an 
isomorphic) copy of F2 inside of F. 

Thus, we have an extension of F2 that contains 8 elements. 

(4.6) Certainly, Q(y/2) is a simple extension of Q. On the other hand, Q(\/2, y/3) would appear not to 
be; but looking at the definition closely you see that a simple extension is one that can be obtained by 
adjoining one element. 

Consider now Q(y/2 + y/3): certainly y/2 + y/3 e Q(\/2, V$), and so Q(y/2 + y/S) C Q(V2, y/3). 
On the other hand, 

(V2 + V3) 3 = 11V2 + 9V3, 
as is readily checked using the Binomial Theorem. Since (\/2 + y/3) 3 € Q(y/2 + y/3), we get 

(11V2 + 9V3) - 9(V2 + V3) e Q(V2 + V3) ^ 2V2 e Q(V2 + V3). 

And so V2 e Q(V2 + V3) as - is there too. Similarly it can be shown that V3 E Q(V2 + V3). The 

upshot is that Q(v / 2, a/3) C Q(^2 + \/3). So Q(v^, V$) is a simple extension! It didn't appear to be 
as we hadn't written it the right way. We will see more precisely at the end of §9. when extensions are 
simple. 



(4.7) What do the elements of the field Q(v2) actually look like? Later we will be answer this question 
in a general and completely satisfactory manner, but for now we can feel our way towards an ad-hoc 
answer. 

Certainly \f2 and any 6 e Q are in Q(\/2) by definition. Since fields are closed under x, any number 
of the form b\[2 e Q(\/2). Similarly, fields are closed under +, so any a + by/2 e Q(\/2) for a e Q. 
Thus, the set 

F = {a + bV2 I a, b e Q} C Q(V2). 

But F is a field in its own right using the usual addition and multiplication of complex numbers. This is 
easily checked from the axioms; for instance, the inverse of a + b\[2 can be calculated: 

a-bV2 a -by/2 a b r- ^ 
x = -5 7^o = —0 7^ - -0 ^JV2 G F, 



a + by/2 a - by/2 

and you can check the other axioms for yourself. We also have QcF (letting 6 = 0) and y/2 e F (letting 
a = 0, b = 1). Since Q(V2) is the smallest field having these two properties, we have Q(y/2) C F. Thus, 

Q(y/2) = F = {a + by/2 \ a, b e Q}. 

Exercise 30 Let a be a complex number such that a 3 = 1 and consider the set 

¥ = {a + aia + a 2 a 2 | Oj S Q} 

1. By row reducing the matrix, 

ao 2a2 2ai 1 
ai ao 2a2 
02 a\ ao 

find an element of F that is the inverse under multiplication of ao + 01 a + 02 a 2 . 

2. Show that F is a field, hence Q(a) = F. 
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(4.8) The previous exercise shows that the following two fields have the form, 



= {a + by/2 + c\/2 | a, b, c £ Q} and Q(j3) = {a + b/3 + c/3 2 \a,b,cG Q}, 
where 

Observe for now that these two fields are different. The first is clearly completely contained in R, but the 
second contains 0, which is obviously complex but not real. 

(4.9) A bijective homomorphism of rings (p : R —* S is called an isomorphism. 
A silly but instructive example is given by the Roman ring, whose elements are 

{. . . , -V, -IV, -III, -II, -1, 0, 1, II, III, IV, V, ■ ■ ■ }, 

and with addition and mutiplication giving such things as IX + IV = XIII and IX x VI = LIV, . . . 
Obviously the ring is isomorphic to Z, and it is this idea of a trivial relabelling that is captured by the 
idea of an isomorphism-two rings are isomorphic if they are really the same, just written in different 
languages! The translation is carried out by the mapping ip. 

It seems a sensible enough idea, but we place a huge emphasis on the way things are labelled, often 
without even realising that we are doing it. The two fields above are a good example, for, 



Q(\/2) and Q ^ — \/2j_\/2\/3i^ are i somor phic! 



(we'll see why in §6.). To illustrate how we might now come unstuck, suppose we were to formulate the 
following, 

"Definition". A subfield ofC is called real if and only if it is contained in KL 

/ _|_ .3/2 % \ 

So Q(s/2) is a real field, but Q I I is not. But they are the same field! A definition 



2 

should not depend on the way the elements are labelled. The problem is that we have become too bogged 
down in the minutiae of real and complex numbers and we need to think about fields in a more abstract 
way. 

(4.10) The previous example has motivated the direction of the next few sections. In the remainder of 
this section we introduce a few more concepts associated with fields. 

It is well known that \pl and tx are both irrational real numbers. Nevertheless, from an algebraic point 
of view, \[2 is slightly more tractable than n, as it is a root of a very simple equation x 2 — 2, whereas 
there is no polynomial with integer coefficients having tt as a root (this is not obvious). 

Let F C E be an extension of fields and a e E. Call a algebraic over F if and only if 

a + a x a + a 2 a? + • • • + a n a n = 0, 

for some do, a\, . . . , a„ € F. In otherwords, a is a root of the polynomial f = ao + a\x + a 2 x 2 + • • • + 
a n x n in F[x]. If a is not algebraic, ie: not the root of any polynomial with F-coefficients, then we say 
that it is transcendental over F. 

As the story of Galois theory develops, we will see that it is the algebraic elements over F that are the 
most easily understood. It is tempting to think of them as having expressions in terms of elements of F, 
the four field operations + , — , x, and roots ^J, a/, . . . , $/,■■■, but as we shall see in §16., the situation 
is much more subtle than that. Indeed there are algebraic numbers that cannot be expressed algebraically. 
For now it is best just to stick to the definition and not read too much into it. 
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(4.11) Some simple examples: 



V2. 



^ and ^2 + 5^3, 



are algebraic over Q, whereas it and e are transcendental over Q; tt is however algebraic over Q(n). 

(4.12) A field can obviously contain many subfields: if we look at C, it contains Q(\/2), E, It also 

contains Q, but no subfields that are smaller than this, in the usual sense that they are properly contained 
in Q. Indeed, any subfield of C contains Q. So, Q is the "smallest" subfield of the complex numbers. 

For any field F, the prime subfield F of F is the smallest subfield of F in the sense that if F' is any 
subfield with F'CF then F' = F. 

Exercise 31 Show that the prime subfield can also be defined as the intersection of all the subfields of F. Thus in particular, the 
prime subfield is contained in every subfield of F. 

Exercise 32 Consider the field of rational numbers Q or the finite field ¥ p having p elements. Show that neither of these fields 
contain a proper subfield (hint: for F p , consider the additive group and use Lagrange's Theorem from §11.. For Q, any subfield 
must contain 1, and show that it must then be all of <Q). 

Whatever the prime subfield is, it must contain 1, hence any expression of the form 1 + 1 + • • • + 1 for 
any number of summands. If no such expression equals the in the field, then we have infinitely many 
distinct such elements, and their inverses under addition, so what we have is basically a copy of Z in F. 
Otherwise, if n is the smallest number of summands for which such an expression is equal to 0, then the 
elements 



forms a copy of Z„ inside of F. 

These comments can be made precise as in the following exercise. It looks ahead a little, requiring the 
first isomorphism theorem for rings in §5. 

Exercise 33 Let F be a field and define a map Z — » F by 



Show that the map is a homomorphism. If the kernel consists of just {0}, then show that F contains Z as a subring. Otherwise, 
let n be the smallest positive integer contained in the kernel, and show that F contains Z„ as a subring. As F is a field, hence an 
integral domain, show that we must have n = p a prime in this situation. 

Thus any field contains a subring isomorphic to Z or to "Lpfor some prime p. But the ring Z p is the 
field F p , and we saw in Exercise 32 that F p contains no subfields. The conclusion is that in the second 
case above, the prime subfield is this copy of F p . In the first case, Z is obviously not a field, but each 
to in this copy of Z has an inverse 1/m in F, and the product of this with any other n gives an element 
m/n € F. The set of all such elements obtained is a copy of Q inside F. 

Exercise 34 Make these loose statements precise: let F be a field and R a subring of F with tp : Z — » R an isomorphism 
of rings (this is what we mean when we say that F contains a copy of Z). Show that this can be extended to an isomorphism 
p:Q^F'CF with tp\ z = (p. 

(4.13) Putting it all together we get: the prime subfield of a field is isomorphic either to the rationals Q 
or to the finite field ¥ p for some prime p. Define the characteristic of a field to be if the prime subfield 
is Q orp if the prime subfield is F p . Thus fields like Q, K and C have characteristic zero, and indeed, any 
field of characteristic zero must be infinte, to contain Q. Fields like F2 , F3 . . . and the field F of order 8 
given above have characteristic 2, 3 and 2 respectively. 

Exercise 35 Show that a field F has characteristic p > if and only if p is the smallest number of summands such that the 
expression 1 + 1 + • • • + 1 is equal to 0. Show that F has characteristic if and only if no such expression is equal to 0. 



1,1 + 1,1 + 1 + 1,... , !+-! + hi = 0, 



n times 




0, if n = 0, 

1 H + 1, (n times), if n > 

— 1 — • • • — 1, (n times), if n < 0. 
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Thus, all fields of characteristic are infinite, and the only examples we know of fields of characteristic 
p > are finite. It is not true though that afield of characteristic p > must be finite. There are some 
examples of infinite fields of characteristic p > below. 

Exercise 36 Suppose that / is an irreducible polynomial over a field F of characteristic 0. Recalling Exercise 16, show that the 
roots of / in any extension E of F are distinct. 

(4.14) A natural question is to ask what fields contain the integers Z? 5 Obviously the rationals Q do, and 
indeed by Exercise 34, as soon as a field contains a copy of Z it must also contain a copy of Q. 

It turns out that we can also construct Q abstractly from Z without having to first position it inside 
another field: consider the set 

F = {(a, b)\a,beZ,b^0, and (a, b) = (c, d) iff ad = be}. 

In otherwords, we take all ordered pairs of elements from Z, but think of two ordered pairs (a, b) and 
(c, d) as being the same if ad = be, eg: think of (0, 1) and (0, 2) as being the same element of F, and 
similarly (1, 1) and (3,3) 

Aside. One makes these loose statements more preicse by defining an equivalence relation on the set of ordered pairs Z X Z as 
(a, b) ~ (c, d) if and only if ad = be. The elements of F are then the equivalence classes under this relation. We will nevertheless 
stick with the looser formulation. 

Define addition and multiplication on F in the following way: 

(a, b) + (c, d) = (ad + be, bd) and (a, b)(c, d) = (ac, bd). 

Exercise 37 

1. Show that these definitions are "well-defined", ie: if (a, b) = (a',b') and (c, d) = (c',d'), then (a, b) + (c, d) = 
(a' ,b') + (c', d') and (a, 6)(c, d) = (a' ,b')(c', d')-m otherwords, if two pairs are thought of as being the same, it doesn't 
matter which one we use in the arithmetic as we get the same answer. 

2. Show that F is a field. 

3. Now define a map ip : F — » Q by ip(a, b) = a/b. Show that the map is well defined, ie: if (a, b) = (a' , b') then 
(fi(a, b) = (p(a' , V). Show that ip is an isomorphism from F to Q. 

This construction can be generalised as the following Exercise shows: 

Exercise 38 Repeat the construction above with Z replaced by an arbitrary integral domain R. The resulting field is called the field 
of fractions of R. 

The field of fractions construction provides some very interesting examples of fields, possibly new in 
the reader's experience. Let F[x] be the ring of polynomials with ^-coefficients where F is any field. 
The field of fractions of this integral domain has elements of the form f{x) / 'g(x) for / and g polynomials, 
in otherwords, rational functions with F-coefficients. The field is denoted F(x) and is called the field of 
rational functions over F. 

An infinite field of characteristic p: If F p is the finite field of order p, then the field of rational functions 
F p (x) is obviously infinite (it contains for example all the polynomials over F p , of which there are an 
infinite number). Moreover, the rational function 1 adds to itself p times to give 0. 

A field properly containing the complex numbers: any field F is properly contained in F(x), even 
F = C. 



Further Exercises for §4. 

Exercise 39 Let F be the set of all matrices of the form " ^ 

26 a 

tion to be the usual addition and multiplication of matrices (and also the addition and multiplication in F5). Show that F is a field. 
How many elements does it have? 



where a, b are in the field F5. Define addition and multiplica- 



or more precisely, which fields contain an isomorphic copy of the integers. 
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Exercise 40 Let F2 be the field of integers modulo 2, and a be an "abstract symbol" that can be multiplied so that it has the 
following property: a X a X a = a 3 = a + 1 (a bit like decreeing that the imaginary i squares to give —1). Let 

F = {a + ba + ca 2 | a, b,c G F 2 }, 

Define addition on F by: (01 + bia + cia 2 ) + (02 + b^a + 02a 2 ) = (01 + 02) + (61 + b-2)a + (ci + e 2 )ct 2 , where the 
addition of coefficients happens in F2. For multiplication, "expand" the expression (ai + 61a + Cia 2 )(d2 + b^a + C2« 2 ) like 
you would a polynomial with a the indeterminate, the coefficients are dealt with using the arithmetic from F2, and so on. Replace 
any a 3 that result using the rale above. 

1 . Write down all the elements of F. 

2. Write out the addition and multiplication tables for F (ie: the tables with rows and columns indexed by the elements of F, 
with the entry in the i-th row and j-th column the sum/product of the i-th and j-th elements of the field). Hence show that 
F is a field (you can assume that the addition and multiplication are associative as well as the distributive law, as these are a 
bit tedious to verify!) Using your tables, find the inverses (under multiplication) of the elements 1 + a and 1 + a + a 2 , ie: 
find 

1 1 

and in F. 



1 + a 1 + a - 
3. Is the extension F2 C F a simple one? 

Exercise 41 Take the set F of the previous exercise, and define addition/multiplication in the same way except that the rule for 
simplification is now a 3 = a 2 + a + 1. Show that in this case you don't get a field. 

Exercise 42 Verify the claim in lectures that the set F = {a + by/2 \ a, b e Q} is a subfield of C. 

Exercise 43 Verify the claim in lectures that <Q( ^2) = {a + b( ^2) + c( ¥2) 2 \a,b,ce Q}. 

Exercise 44 Find a complex number a such that Q>(-\/2, i) = Q(o). 

Exercise 45 Is Q(\/2, v^, V7) a simple extension of Q(\/2, \/3), Q(v / 2) or even of Q? 

Exercise 46 Let V be an "abstract symbol" that has the following property: V 2 = —V — 1 (a bit like i squaring to give —1). Let 

F = {a + 6V I a,b G M}, 

and define an addition on F by: (a\ + 61V) + (02 + 62 V) = (ai + 02) + (fei + ^2) V. For multiplication, expand the expression 
(ai + 61 V)(a2 + 62 V) normally (treating V like an indeterminate, so that VV = V 2 , and so on), and replace the resulting V 2 
using the rale above. Show that F is a field, and is just the complex numbers C. Do exactly the same thing, but with symbol A 
satisfying A 2 = \/2A — ^5. Show that you still get the complex numbers. 
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§5. Rings II: Quotients 

(5.1) Let R be a commutative ring. A subset a C R is called a principal ideal iff there is some r e R 
such that 

a = {pr | p e i?}. 

In other words, a is precisely the set of all multiples of some fixed element r. Denote such an ideal by 

(r). 

The name is "principal" ideal as there are more general kinds of ideal. Nevertheless, in all rings we 
will be concerned with, every ideal in this more general sense turns out to be a principal ideal, so we will 
drop the principal from now on and just say "ideal". 

Aside. Here is the more general notion: a C R is an ideal iff o is a subgroup of the abelian group (R, ©) and for any s £ R we 
have so={sp|p£o}C o and similarly as C a. For example, if R = Z[x] , then the set o of polynomials with even coefficients 
and no constant term form an ideal. But there is no single polynomial / in Z[x] such that every polynomial in o is a multiple of /. 

Notice that if s £ R and (r) is an ideal, then s(r) = {s(pr) \p e R} = {(sp)r \p <G R} which is 
C (r). Similarly, (r)s C (r) (as R is commutative). In other words, multiplying the elements of an ideal 
by an arbitrary element of the ring gives elements of the ideal. 

(5.2) In any ring there are the trivial ideals (0) = {0} and (1) = R, the second one as any element of R 
is a multiple of 1. 

Exercise 47 

1. Show that the only ideals in a field F are the two trivial ones (hint: use the property of ideals mentioned at the end of the 
last paragraph). 

2. If R is a commutative ring whose only ideals are {0} and R, then show that R is a field. 

3. Show that in the non-commutative ring M n (F) of n X n matrices with entries from the field F there are only the two trivial 
ideals, but that M n (F) is not a field. 

(5.3) For another example, consider the ring Q[a:], the number \f2 € M, and the evaluation homomor- 
phism : QM — > ^ given by 

e^(a n x n H h a ) = a n (V2) n H h a Q . 

(see §2.). Let a be the set of all polynomials in Q[x] that are sent to £ R by this map. Certainly 

x 2 - 2 e a (as Vf - 2 = 0). If / = (x 2 - 2)g G Q[x], then 

£ V2(f) = £ V2( x2 - 2 ) £ y2(.9) = x e y2(.9) = °, 

using the fact that is a homomorphism. Thus, / e a, and so the ideal (x 2 — 2) is C a. 

Conversely, if h is sent to by e^, ie: h G o, we can divide it by x 2 — 2 using the division algorithm, 

h = (x 2 - 2)q + r, 

where deg r < 2, so that r = ax + b for some a, b e Q. But since e^ih) =0we have 

(V2 2 - 2)q{V2) + r(V2) = => r(V2) = => aV2 + 6 = 0. 

If a 0, then V2 e Q as a, b € Q, which is plainly nonsense. Thus a — 0, hence 6 = too, so that 
r = 0, and hence h = (x 2 — 2)q e (x 2 — 2), and we get that o C (x 2 — 2). 

The conclusion is that the set of polynomials in Q[x] sent to zero by the evaluation homomorphism 
e^2 is an ideal. 
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(5.4) This in fact always happens. Remember that if R, S are rings and tp : R — ► S a ring homomor- 
phism, then the kernel of tp, denoted ker tp, is the set of all elements of R sent to G S by tp, ie: 

kevp = {re R\<p(r) = e S}. 

Proposition 2 IfF is afield and S a ring then the kernel of a homomorphism p : F[x] ► S is an ideal. 

Proof: Choose g e ker p non-zero of smallest degree (which we can do since the degrees of polynomials 
come from the set Z + U {— oo}). We claim that keri^s = (g), for which we need to show that these two 
sets are mutually contained within each other. On the one hand, if pg e (g) then 

<p(pg) = <p(p)<p(g) = <f{p) x = 0, 

since g € ker tp. Thus, (g) C ker tp. 

On the other hand, let / <G ker p and use the division algorithm to divide it by g, 

f = qg + r, 

where degr < dcg.g. Now, r = f - qg => tp(r) = p(.f - qg) = tp(f) - p{q)p(g) = - p(q).0 = 0, 
since both /, g E ker tp. Thus, r is also in the kernel of tp. If r was a non-zero polynomial, then we would 
have a contradiction because deg r < deg g, but g was chosen from ker p to have smallest degree. Thus 
we must have that r = 0, hence / = qg € (g), ie: keip C (g). □ 

(5.5) Let (/) C F[x] be an ideal and g <G F[x] any polynomial. The set 

g + (f) = {g + h\h€(f)}, 

is called the coset of (/) with representative g (or the coset of (/) determined by g). 

(5.6) As an example, consider the ideal a = (x) in F 2 [x] . Thus, a is the set of all multiples of x, which 
is just the same thing as the collection of polynomials in F 2 [x] that have no constant term. What are the 
cosets of a? Let g be any polynomial and consider the coset g + (x) . The only possibilities are that g has 
no constant term, or it does, in which case this term is 1 (we are in F 2 [a;]). 

If g has no constant term, then 

g + (x) = (x). 

For, g+ a polynomial with no constant term is another polynomial with no constant term, ie: g + (x) C 
(x). On the other hand, if p e (x) is any polynomial with no constant term, then p — g e (x) so 
V = g + (P - .9) e g + (x), ie: (x) Cg + (x). 

If g does have a constant term, you can convince yourself in exactly the same way that, 

g+(x) = l + (x). 

Thus, there are only two cosets of (x) in ¥ 2 [x], namely the ideal a = (x) itself and 1 + a; in English, the 
first coset consists of those polynomials without constant term, and the second those with a constant term. 

Notice that these two cosets are completely disjoint, but every polynomial is in one or the other of them 
of them. 

(5.7) Here are some basic properties of cosets: 

1. Every polynomial g is in some coset of (/), for g = g + 0xf£g + (/). 

2. For any q, we have qf + (/) = (/), so multiples of / get "absorbed" into the ideal (/). 
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The following three things are equivalent: (i). g\ and g 2 lie in the 
same coset of (/); (ii). g x + (/) = g 2 + (/>; (iii). gi and g 2 differ 
w ' by a multiple of /. To see this: (iii) => (ii) If 51—52 = p/ then 
= ,g 2 + pf so that , 9l + (/) = g 2 + pf + (/) = g 2 + (/); (ii) 
(i) Since g\ e 31 + (/) and g 2 G 32 + (/). and these cosets are 
equal we have that g\, g% lie in the same coset; (i) =>■ (iii) If g\ and 32 lie in the same coset, ie: 
3i, 32 G h + (/), then each g { = h + p.J 31 - g 2 = (pi - p 2 )/- 

It can perhaps best be summarised by saying that g\ and g 2 lie in the same coset if and only if this 
coset has the two different names, g\ + (/) and g 2 + (/), as in the picture. 




.91 + (/> 



The situation in the picture opposite never happens. If the two cosets 



pictured are called respectively g\ + (/) and g 2 + (/), then h is in both 
g 2 + (/) and so differs from g\ and g 2 by multiples of /, ie: g\ — h — pif and 
h — g 2 = p 2 f, so that g\— g 2 = (pi + p 2 )f. Since g\ and g 2 differ by 
a multiple of /, we have gi + (/) = g 2 + (/). 



Thus, the cosets of an ideal partition the ring. 

(5.8) As an example of all these ideas, let x 2 — 2 6 Q[x] and consider the ideal 

(x 2 -2) ={p(x 2 ~2)\peQ[x}}. 

Certainly, (x 3 — 2x + 15) + (x 2 — 2) is a coset, but is it written in the nicest possible form? If we divide 

by x 2 - 2: 

x 3 - 2x + 15 = x{x 2 - 2) + 15, 
we have that a; 3 — 2x + 15 and 15 differ by a multiple of x 2 — 2. That gives 

(x 3 -2x + 15) + (x 2 - 2) = 15 + (a; 2 - 2). 

(5.9) If we look again at the example of the coset (x) in F2 [x], there were only two cosets, 

(x) = + (x) and 1 + (x), 

that corresponded to the polynomials with constant term and the polynomials with constant term 1 
(these are the only possibilities for the coefficients in F2[x]!) We could try "adding" and "multiplying" 
these two cosets together according to, 

(0 + (x)) + (0 + (x)) = + (x), (1 + (x)) + (0 + (x)) = 1 + (x), (1 + (x)) + (1 + (x)) =0+(x), 

and so on, where all we have done is to add the representatives of the cosets together using the addition 
from F2. Similarly for multiplying the cosets. This looks awfully like F2, but with + (x) and 1 + (x) 
replacing and 1. 

(5.10) In fact this always happens. Let (/) be an ideal in F[x], and define an addition and multiplication 
of cosets of (/) by, 

(.91 + (/)) + (.92 + (/)) = (.91 + .92) + (/) and ( 9l + (f))(g 2 + (/)) = ( 9l g 2 ) + (/), 
where the addition and multiplication of the g^s is happening in F[x}. 

Theorem 3 The set of cosets F[x]/(f) together with the + and x above is a ring. 
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Call this the quotient ring of F[x] by the ideal (/). All our rings have a "zero", a "one", and so on, and 
for the quotient ring these are, 



element of a ring 


corresponding element in F[x]/(f) 


a 


9+ if) 


—a 


(-9) + (/> 





o + (/> = (/> 


1 


! + </> 



Exercise 48 To prove this theorem, 

1. Show that the addition of cosets is well defined, ie: if g'. + (/} = gi + (/}, then 

(9i + <?2) + (/> = (si+ 92 ) + (/>. 

2. Similarly, show that the multiplication is well defined. Actually, it is because of this and the previous part that we can only 
take the quotients of polynomials by ideals, and not just any old subring. 

3. Now verify the axioms for a ring. 

Notice that the quotient is a ring, but not necessarily a field, So the motivating example above, where 
the quotient turned out to be the field F 2 was a little special. 

(5.11) Let x 2 + le R[x], and look at the ideal (x 2 + 1). We want to see what the quotient R[x]/(x 2 + 1) 
looks like. First, any coset can be put into a nice form: for example, 

x 4 + x 2 + x + 1 + (x 2 + 1) = x 2 (x 2 + 1) + {x + 1) + (x 2 + 1), 

where we have divided x A + x 2 + x + 1 by x 2 + 1 using the division algorithm. But 

x 2 (x 2 + 1) + (x + 1) + (x 2 + 1) = x + 1 + (x 2 + 1), 

as the multiple of x 2 + 1 gets absorbed into the ideal. In fact, for any g e R[x] we can make this argument, 

g + (x 2 + 1) = q(x 2 + 1) + (ax + b) + (x 2 + 1) = ax + b + (x 2 + 1), 

for some a, b G R, so the set of cosets can be written as 

R[x]/(x 2 + 1) = {ax + b + (x 2 + 1) | a, b e M}. 

Now take two elements of the quotient, say (x+l)+ (x 2 + 1) and (2x — 3) + (x 2 + 1), and add/multiply 
them together: 

| (x + 1) + (x 2 + 1) | + | (2x - 3) + (x 2 + 1) | = 3x - 2 + (x 2 + 1) , 

and 

| (a; + 1) + (x 2 + 1) J x | (2x - 3) + (x 2 + 1) J = (2a; 2 - x - 3) + (x 2 + 1) 

= 2(a; 2 + 1) + (-x - 5) + (x 2 + 1) 
= -x-5 + (x 2 + 1). 

Now "squint" your eyes, so that the "+(a; 2 + 1)" part in the above disappears, and ax + b + (x 2 + 1) 
becomes the complex number ai + b e C. Then 

(i + 1) + (2i - 3) = 3i - 2 and (i + l)(2i - 3) = -i - 5. 

The addition and multiplication of cosets in R[x]/ (x 2 + 1) looks exactly like the addition and multipli- 
cation of complex numbers! 
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(5.12) In order to see what quotient rings really look like, you need to use the, 



First Isomorphism Theorem. Let R, S be rings and <p : R — ► S a ring homomorphism. Then 

R/kertp = Imp C S, 
where the isomorphism tp : R/kerip — ► Imip is given by <p(r + kerip) — ip(r). 

Aside. For any ring R and homomorphism tp, ker tp is an ideal of R in the more general sense mentioned at the beginning of the 
section. Thus it makes sense to take the quotient R/ker p. 

(5.13) Getting back to the example above, let R = R[x] and S = C. Let the homomorphism tp be the 
evaluation at i homomorphism, 



In exactly the same way as an earlier example, one can show that 

kerei = (x 2 + 1). 

On the other hand, if ai + b e C, then ai + b = Si(ax + b), so the image of the homomorphism £j is all 
of C. Feeding all this into the first homomorphism theorem gives, 



Exercise 49 Going back to the general case of an ideal a in a ring R, consider the map rj : R —* R/ a given by, 



sending an element of R to the coset of a determined by it. 

1 . Show that r) is a homomorphism. 

2. Show that if fa is an ideal in R containing a then r)(b) is an ideal of R/ a. 

3. Show that if fa' is an ideal of R/ a then there is an ideal fa of R, containing a, such that r/(b) = fa'. 

4. Show that in this way, r) is a bijection between the ideals of R containing a and the ideals of R/ a. 

Further Exercises for §5. 

Exercise 50 Let 9 : R — > S be a ring homomorphism. Show that, 

1. 0(0) = (hint: consider 6(0 + 0)), 

2. 6 is injective (ie: 1 to 1) if and only if ker 9 = {0}. 

Exercise 51 Determine which of these maps are ring homomorphisms. 

1. The map 9 : Z -> Z given by 9{n) = 2n. 

2. The map 9 : Z -> Z given by 0(n) = -n. 

3. The map 9 : K -> E given by 9{x) = \x\. 

4. The map 9 : C — + C given by 9(z) = z (i.e. complex conjugation). 

5. The map 9 : C -> Mat 2 (K) defined by 0{x +iy) = * V . 

y 

Exercise 52 Determine whether the following maps are ring homomorphisms. 

1. 6 : Z[x] -» Z given by 0(/(a;)) = /(0). 

2. : Z[x] - Z given by 9(f(x)) = /(l). 

3. : Z[x] -> Z[x] given by 9(f(x)) = /(-x). 

4. : Z[x] -» Z given by 0(/(x)) = /(2) 2 . 

Exercise 53 Let = (1 + v / 5)/2 (in fact the Golden Number). 

1. Show that the kernel of the evaluation map : Q[x] — * C (given by e^(/) = /(</>)) is the ideal (x 2 — x — 1). 

2. Show that Q(<£) = {a + b<f> | a, b G Q}. 

3. Show that Q(<}!>) is the image in C of the map e^. 




R[x]/{x 2 + 1) = C. 



r)(r) = r + a, 
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§6. Fields II: Constructions and more examples 

(6.1) An ideal (/) is maximal if and only if (/) C F[x] and the only ideals of F[x] containing it are itself 
and the whole ring F[x], ie: 

(f)CaCF[x], 
with a an ideal implies that a = (/) or a = F[x\. 

(6.2) The principle result of this section is, 

Theorem B (Constructing Fields). The quotient ring F[x]/(f) is afield if and only if (f) is a maximal 
ideal. 

Proof: By Exercise 47, a commutative ring R is a field if and only if the only ideals of R are the trivial 
one {0} and the whole ring R. Thus the quotient F[x]/ (/) is a field if and only if its only ideals are 
the trivial one (/) and the whole ring F[x}/ (/). By Exercise 49, there is a one to one correspondence 
between the ideals of the quotient F[x]/ (/) and the ideals of F[x] that contain (/). Thus F[x]/(f) has 
only the two trivial ideals precisely when there are only two ideals of F[x] containing (/), which is the 
same as saying that (/) is maximal. □ 

(6.3) Suppose now that / is an irreducible polynomial over F, and let (/) C I C F[x] with / an ideal. 
Then I = (h) giving (/) C (h), and so h divides /. Since / is irreducible this means that h must be either 
a constant A e F or A/, so that the ideal I is either (A) or (A/). But (A/) is just the same as the ideal 
(/). On the otherhand, any polynomial g can be written as a multiple of A, just by setting g = X(X~ 1 g), 
and so (A) = F[x]. 

Thus, if / is an irreducible polynomial, then the ideal (/) is a maximal one. Conversely, if (/) is 
maximal and h divides /, then (/) C (h), so that by maximality (h) = (/) or (h) = F[x]. 

Exercise 54 Show that (/} = (h) if and only if h = A/ for some constant A £ F. Similarly, (h) = F[x] if and only if h = A 
some constant. 

Thus, the ideal (/) is maximal precisely when / is irreducible, giving, 
Corollary. F[x]/(f) is afield if and only if f is an irreducible polynomial over F. 

(6.4) The polynomial x 2 + 1 is irreducible over the reals R, so the quotient ring M.[x]/ (x 2 + 1) is a field. 

(6.5) The polynomial x 2 — 2x + 2 has roots 1 ± i, hence is irreducible over K, giving the field, 

R[x]/(x 2 -2x + 2). 

Consider the evaluation map e 1+i M[a;] — ► C given as usual by S\ + i(f) = f(l + i). In exactly the 
same way as the example for m §5., one can show that kerei +i = (x 2 — 2x + 2). Moreover, 
a + bi = Ei+i(a — b + bx) so that the evaluation map is onto C. Thus, by the first isomorphism theorem 
we get that, 

R[x]/(x 2 -2x + 2)^C. 

What this means is that one can construct the complex numbers in the following (slightly non-standard) 
way: start with the reals K, and define a new symbol, V say, which is defined by the property, 

V 2 = 2V - 2. 

Now consider all expressions of the form c + dV for c, d e M. Add and multiply two such expressions 
together as follows: 

(ci + diV) + (c 2 + d 2 V) = (ci + c 2 ) + (di + d 2 )V 

(ci + diV)(c 2 + cZ 2 V) = cic 2 + (dd 2 + dic 2 )V + d x d 2 V 2 

= cic 2 + (cid 2 + dic 2 )V + did 2 (2V - 2) 

= (cic 2 - 2d 1 d 2 ) + (cid 2 + d lC2 + 2d 1 d 2 )V. 
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Exercise 55 By solving the equations cx — 2dy = 1 and cy + dx + 2dy = for x and y in terms of c and d, find the inverse of 
the element c + dSJ . 

Exercise 56 According to Exercise 18, if / is irreducible over R then / must be either quadratic or linear. Suppose that / = 
ax 2 + bx + c is an irreducible quadratic over R. Show that the field R[x]/{zx 2 + bx + c) = C. 

(6.6) We saw in §3. that the polynomial x 4 + x + 1 was irreducible over the field F 2 . Thus the quotient 

¥ 2 [x]/(x i + x + l), 

is a field. Each of its elements is a coset of the form g + (x 4 + x + 1) . Use the division algorithm, dividing 

g by x 4 + x + 1 to get 

g + (x 4 + x + 1) = q(x 4 + x + 1) + r + (x 4 + x + 1) = r + (x 4 + x + 1), 

where the remainder r must be of the form ax 3 + bx 2 + cx + d, for a, b,c,d € F 2 . Thus every element 
of the field has the form ax 3 + bx 2 + cx + d + (x 4 + x + 1), of which there are at most 16 possibilities 
(2 choices for a, 2 choices for b, . . .). 
Indeed these 16 are all distinct, for if 

a\x 3 + b\x 2 + C\X + d\ + (x 4 + x + 1) = a 2 x 3 + b 2 x 2 + c 2 x + d 2 + (x 4 + x + 1) 

then, 

(ai - a 2 )x 3 + (&i - b 2 )x 2 + (ci - c 2 )x + (di - d 2 ) + (x 4 + x + 1) 

= (a; 4 + x + 1) ^ (ai - a 2 )x 3 + (6i - b 2 )x 2 + (ci - c 2 )x + (di - d 2 ) e (x 4 + x + 1). 

Since the non-zero elements of the ideal are multiples of a degree four polynomial, they have degrees 
that are at least four. Thus the only way the cubic can be an element is if it is the zero polynomial. In 
particular, a\ — a 2 = ■ ■ ■ = di — d 2 = so the two cosets are the same. 
The upshot is that the quotient ring is a field with 16 elements. 

(6.7) Returning to the general situation of a quotient F[x]/(f) by an irreducible polynomial /, the re- 
sulting field contains a copy of the original field F, obtained by taking the cosets A + (/). 

Exercise 57 Show that the map A A + (/) is an injective homomorphism F — > F[x]/{f), and so F is isomorphic to its image 
ia *>]/</). 

Blurring the distinction between the original F and this copy inside F[x]/(f), we get that F C 
F[x]/(f) is an extension of fields. 

(6.8) Generalising the example of the field of order 16 above, if F p is the finite field with p elements and 
/ € F p [x] is an irreducible polynomial of degree d, then the quotient F p [x] / (/) is a field containing p d 
elements. They have the form, 

fld-ix^ 1 + ---a + (/), 

where / = bdx d + - ■ - + b\x + bo and the a, G ¥ p . Any two such are distinct by exactly the same argument 
as above. Letting a = x + (f) and replacing F p by its copy in F p [x]/(/) (ie: identifying A e F p with 

A + </> eF p [z]/</»,wehave, 

®p[x]/(f) = {a d -ia d+1 + • • • a K e F p }, 

where two such expressions are added and multiplied like "polynomials" in a. The only proviso is that 
since / + (/) = (/), we have the "rule" bda d + • • • + b\a + b = 0, which allows us to remove any 
powers of a bigger than d that occur in such expressions. 
Call a a generator for the finite field. 
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(6.9) The polynomial x 3 + x + 1 is irreducible over the field F 2 (it is a cubic and has no roots) so that 

¥ 2 [x]/(x 3 +x + l), 

is a field with 2 3 = 8 elements of the form F = {a + ba + ca 2 \ a, b, c € F 2 } subject to the rule 
a 3 + a + 1 = 0, ie: a 3 = a + 1. This is the field F of order 8 in §4.. 

Exercise 58 Construct fields with exactly: 

1. 125 elements; 

2. 49 elements; 

3. 81 elements; 

4. 243 elements. 

(6.10) Theorem B and its Corollary clears up a little mystery that has lingered since the end of §4. 
Remember from there that the fields 



were different (that is, their elements were different), but isomorphic? The polynomial x 3 — 2 is irre- 
ducible over Q, either by Eisenstein, or by observing that its roots do not lie in <Q>. Thus 

Q[x]/{x 3 -2) 1 

is an extension field of Q. Consider the two evaluation homomorphisms : Q[x] — ► C and sp : 
Q[x] —> C where f3 is the complex number adjoined to Q in the second extension above. Since, and this 
is the key bit, 

^and^/^ 

are both roots of the polynomial x 3 — 2, we can show in a similar manner to examples at the end of §5. 
that kere = (x 3 — 2) = kere^. Thus, 



[x]/ker£3^ 



v]/(x 3 -2) 



1 st Isomorphism Theorem 



\x]/kexEf) 



Ime 



^2 



Ime,-- 



We can see what the image of e 3/5 must be by considering the diagram, 

a n x n H Va lX + a algorithm' q ^ ~ 2 ) + ( a + bx + cx 2 ) 



£.3, 



a n (<f2) n + ■ ■ ■ + ai^2 + 



a 



(a + 6^2 + c(^2) 2 ) 



The point is that e 3^ is a ring homomorphism, so that 

£ 3/^(a n x n + • • • + a\x + do) = £ 3/^(q(x 3 — 2) + (a + bx + cx 2 )) 

= e 3^(q)e 3^(x 3 — 2) + £3/2(0 + bx + cx 2 ) 

= £ 3^(q).0 + e 3/2(0 + bx + cx 2 ) =a + b\^2 + c(v / 2) 2 
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Pictures like the one above, where you travel two routes but end up in the same place are called 
commutative diagrams. The result of the argument provided by the diagram is that 

Ime 3^ C {a + 6^2 + c(^2) 2 e C | a, b, c e Q} = Q(xfy. 

On the other hand, any complex number of the form a + b\/2 + c(\/2) 2 is the image of a + bx + cx 2 . 
Thus Ime 3^ = Q(v^2)- Similarly one can show that Ime^ = Q(/3). Filling this information into the first 
of the two diagrams above gives the claimed isomorphism between Q(s/2) and Q(/3): 

Q[x]/(x 3 - 2) "abstract" field 




"concrete" realisations in C 



(6.11) A special place is reserved in number theory for those fields of the form Q [#]/(/), for / an 
irreducible polynomial over the rationals Q. Such a field is called a number field, and their detailed study 
is the subject of algebraic number theory. 

Suppose that (3 is a root of the polynomial / and consider the subfield of C given by Q(/3). The 
reasoning in the example above can be extended to show that the two fields Q[x]/(f) and Q(/3) are 
isomorphic (see also Theorem D). Indeed, if {(3\, . . . , /3„} are the roots of /, then we have n mutually 
isomorphic fields inside C, namely Q(/3i), . . . ,Q(/3„). The isomorphisms from Q[x]/(f) to each of these 
are called the Galois monomorphisms of the number field Q[x]/ (/). 

(6.12) Returning to some generality, the observation that the field F[x]/(f) is an extension of F has 
far-reaching consequences that goes by the name of, 

Kronecker's Theorem. Let f be a polynomial in F[x]. Then there is an extension of F containing a 
root of f. 

Proof: If / is not irreducible over F, then factorise as / = gh with g irreducible over F and proceed as 
below but with g instead of /. The result will be an extension containing a root of g, and hence of /. Thus 
we may suppose that / is irreducible over F and / = a n x n + a„_ix™ _1 + • • • a\x + a with the a, <G F. 
Replace F by its isomorphic copy in the quotient F[x]/(f), so that instead of a,, we write aj + (/), ie, 

/ = (a„ + (f))x n + (a„_! + (Z))^" 1 + • • • + (ai + (f))x + (a + (/)). 

Consider the field E = F[x}/ (f) which is an extension of F and the element x+(f) G E. If we substitute 
x + (/) into the polynomial then we perform all our arithmetic in E, ie: we perform the arithmetic of 
cosets, and bear in mind that the zero of this field is the coset (/). Thus, 

f(x + (/)) = (a n + (f))(x + (/))" + (a n _! + (f))(x + (/})"" 1 + . . . + ( 0l + </»(* + (/)) + (ao + (/)) 
= (a n x n + (/)) + (a^ix"- 1 + (/)) + • • • + (a lX + (/)) + (a + (/}) 
= (a n x n + a^x"- 1 + • • • + ai x + a ) + (/) = / + (/) = (/), 

which in the field E translates as f(p) = for [i = x + (/). □ 

Corollary. Let f be a polynomial in F[x\. Then there is an extension of F that contains all the roots of 
f. 

Proof: Repeat the process described in the proof of Kronecker's Theorem at most deg / number of 
times, until the desired field is obtained. 
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Further Exercises for §6. 

Exercise 59 Show that x 4 + x 3 + x 2 + x + 1 is irreducible over F 3 . How many elements does the resulting extension of F3 have? 
Exercise 60 As linear polynomials are always irreducible, show that the field F[x]/(ax + b) is isomorphic to F. 
Exercise 61 

1. Show that 1 + 2x + x 3 e ¥3[x] is irreducible and hence that F = Fs[x]/(1 + 2x + x 3 ) is a field. 

2. Show that every coset can be written uniquely in the form (a + bx + cx 2 ) + (f + 2x + x 3 ) with a, b, c £ F3. 

3. Deduce that the field F has exactly 27 elements. 

Exercise 62 Find an irreducible polynomial f(x) in F5 [x] of degree 2. Show that F5 [x]/{f(x)) is a field with 25 elements. 



Exercise 63 



1 



Show that the polynomial x 3 — 3x + 6 is irreducible over Q. 



2. 



Hence, or otherwise, if 




prove that 

(a) the fields Q(a + 0) and Q(u>a + <U/3) are distinct (that is, their elements are different), but, 

(b) Q(a + 0) and Q(wa + T30) are isomorphic. 



You may assume that u)a + u>f3 is not a real number. 
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7. Ruler and Compass constructions I 



If we allow ourselves a slightly fanciful historical interlude, we can imagine that the earliest civilizations 
to embrace agriculture came up against the problem of subdividing arable land into portions to be worked. 
Thus the Babylonians for instance would have needed the basics of surveying at their disposal. The most 
basic surveying instruments are wooden pegs and rope, with which you can do two very basic things: 
two pegs can be set a distance apart and the rope stretched between them; alternatively, one of the pegs 
can be kept stationary and we can take the path traced by the other as you walk around keeping the rope 
stretched taut. In otherwords, we can draw a line through two points and draw a circle centered at one 
point and passing through the other. 

(7.1) Instead of the Euphrates river valley, we work in the complex plane C. We are thus able, given two 
numbers z, w G C, to draw a line through them using a straight edge, or to place one end of a compass at 
z, and draw the circle passing through w: 



Notice that neither of these operations involves any "measuring". 

(7.2) With these two constructions in hand, we call a complex number £ constructible iff there is a 
sequence of numbers 

0, l,i = (1X2, ■ ■ • ,Cn = C) 
with Q obtained from earlier numbers in the sequence in one of the three following ways: 




In these pictures, p, q, r and s are all < j. Notice that we are given the three numbers 0, 1, i "for free", so 
that they are indisputably constructible. The reasoning is this: if you stand in a plane (not R 2 or C, but a 
plane without coordinates), then your position can be taken as 0; decree a direction to be the real axis and 
a distance along it to be length 1; construct the perpendicular bisector of the segment from —1 to 1 (as in 
the next paragraph) and measure a unit distance along this new axis (in either direction) to get i. 

(7.3) The basic two moves are a little restrictive for the purposes of determining which numbers are 
constructible. There are a number of other constructions though, that follow immediately from them. For 
instance, we can construct the perpendicular bisector of a segment AB by the following three steps: 
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The pictures are supposed to be self-explanatory, modulo the following conventions. A ray, centered at 
some point and tracing out a dotted circle is obviously meant to describe usage of the compass. If the ray 
is marked r, as in the first two pictures above, this does not mean that the radius of the circle has been 
set to some length r, as we can not do this. It merely means that in passing from the first picture to the 
second, the setting on the compass is kept the same. 

We can convince ourselves that the construction works as follows: think of the set S of points in C 
that are an equal distance from both A and B. After a moments thought, you see that this must be the 
perpendicular bisector of the line segment AB that we are constructing. Lines are determined by any two 
of their points, so if we can find two points equidistant from A and B, and we draw a line through them, 
this must be the set S that we want (and hence the perpendicular bisector). But the intersections of the 
two circular arcs are clearly equidistant from A and B, so we are done. 

(7.4) As well as bisecting segments, we can bisect angles, ie: if two lines meet in some angle we can 
construct a third line meeting these in angles that are each half the original one: 




It is worth repeating that none of the angles in this picture can be measured. Nevertheless, the two new 
ones must each be half the old one. 



(7.5) Given a line and a point P not on it, we can construct a new line passing through P and perpendic- 
ular to the line. We describe this as "dropping a perpendicular from a point to a line": 



- perpendicular bisector of AB 



A- 



B 



A 



B 



(7.6) Given a line I and a point P not on it we can construct a new line through P parallel to /: 



© 



perpendicular from P to I 



Q 





line I 



Q A 



B 



perpendicular bisector of AB 



Q A 



R 



P 



R 



B 
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Perhaps some explanation wouldn't go amiss: the first step is to drop a perpendicular from P to the line 
/, meeting it at the new point Q. Next, set your compass to the distance from P to Q, and transfer this 
circular distance along the line to some point, drawing a semicircle that meets I at the points A and B. 
Construct the perpendicular bisector of the segment from A to B, which meets the semicircle at the new 
point R. Finally, draw a line through the points P and R. It should be obvious that P are R are equidistant 
from the line I, hence the line through them is parallel to I. 



(7.7) Here are some basic examples that show that the numbers 

.31 1 
3, - and — = H — -=i 

4 V2 V2 

are constructive: 



bisector of the right angle 
Xl 1 




In the second example, we have bisected the segment from to 1 and then the segment from 4 to 1. 

(7.8) Looking at the construction of | above, it is less clear how one might construct the number or 
the golden ratio, 

, 1 + V5 



Nevertheless, these numbers are constructible, and the reason is the first non-trivial fact about con- 
structible numbers: they can be added, subtracted, multiplied and divided 6 . Defining K to be the set 
of constructible numbers in C, we have, 



Theorem C. K is a subfield ofC 



Proof: The proof proceeds in two steps: as it is easier to deal with real numbers rather than complex, we 
show that the theorem can be reduced to the real case, and then show that the real constructible numbers 
form a subfield of ML 

First observe that C, E K precisely when Re£ and lm( are in K. For, if £ € K then dropping per- 
pendiculars to the real and imaginary axes give the numbers Re£ and Im^i, the second of which can be 
transferred to the real axis by drawing the circle centered at passing through Im£z. On the otherhand, 
if we have Re( and Im£ on the real axis, then we have lm(i too, and constructing a line through Re£ 
parallel to the imaginary axis and a line through lm(i parallel to the real axis gives £. 

We now reduce the Theorem to the real case by showing that K is a subfield of C if and only if K fl K 
is a subfield of R. As the intersection of two subfields of C is a subfield of C, the "only if" case is 
immediate. 

Suppose then that the real constructible numbers form a subfield of the reals. We show that K is then 
a subfield of C, for which we need to show that if z, w are constructible complex numbers then so are 



In principle you can now throw away your calculator, and perform arithmetic operations with ruler and compass ! This is not as 
far-fetched as it sounds, even if it is a little impractical. To compute cos £ of a constructable number x, construct as many terms of 
the Taylor series, 

2 4 
X X 
COS X = 1 1 

2! 4! 

as you need (your calculator only ever gives you approximations anyway). 
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z + w, —z, zw and 1/z. By the observation above we have that the real and imaginary parts of z and w 
are real constructible numbers. Then 

— z = — Rez — Imzi 

z + w = (Rez + Rew) + (Imz + lmw)i 

zw — (RezReui — Imzlimu) + (Rezlmw + lmwRez)i 

1 Rez Imz 

z Rez 2 + Imz 2 Rez 2 + Imz 2 ' 

As the constructible numbers form a subfield of K, hence are closed under the four basic field operations, 
the real and imaginary parts of z and w are constuctible. Thus, the complex numbers are constructible as 
their real and imaginary parts are. 

In light of this, it suffices to show that the real constructible numbers are a subfield of the reals, for 
which we need to show that if a, b G A'fll then so are —a,a + b, ab and l/o. 

1. K fl M is closed under + and — : The picture below left shows that if a G K n ffi then so is —a. 

t + 




—a a 



Similarly, the two on the right give a, b e K DR ^ a + b e K CiR. 



2. Kfll is closed under x, as can be seen by following through the steps: 



i 

ah 


i * 

o 

i ah 

r\ 




. hi* 


i. A 

\- ..... © 

\ parallel \ 




k 

o 










lab ^ 



Seeing that the construction works involves studying a pair of similar triangles. 
3. K n K is closed under which is of course just the previous construction backwards: 



i 

en 


k i 

o 




\ i 




, i 

► 




\ ► 


— VvS ► 
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□ 

(7.9) As any subfield of C contains Q we thus have the, 
Corollary. Any rational number is constructible. 

(7.10) Not only can we perform the four basic arithmetic operations with constructible numbers, but we 
can extract square roots too: 

Theorem 4 If Q E K then y/C G K. 

Proof: First of all, we can construct the square root of any real number: 








1 

r 

midpoint of OP 

For a justification that this works, see Exercise 64. Next, any complex number: 



la a 



© 



-bisector 



la a 



where we have used the construction of real square roots in the second step. 

Exercise 64 Show that in the following picture, 



□ 




the length x = y/a. 

Constructing angles and polygons 
(7.11) We say that an angle can be constructed when we can construct two lines intersecting in that angle. 

Exercise 65 

1. Show that we can always assume that one of the lines giving an angle is the positive real axis. 

2. Show that an angle 8 can be constructed if and only if the number cos 8 can be constructed. Do the same for sin 8 and 
tan 8. 



Exercise 66 Show that if <p, 8 are constructible angles then so are ip + 8 and ip — i 



41 



(7.12) A regular n-sided polygon or regular n-gon is a polygon in C with n sides of equal length and n 
interior angles of equal size. 

Exercise 67 Show that a regular n-gon can be constructed centered at £ C if and only if the angle ^ZL can be constructed. Show 
that a regular n-gon can be constructed centered at £ C if and only if the complex number 

£ = cos h i sin — , 

n n 

can be constructed. 



Exercise 68 Show that if an n-gon and an m-gon can be constructed for n and m relatively prime, then so can a mn-gon (hint: 
use the Z- version of Theorem 1). 

(7.13) For what n can one construct a regular n-gon? It makes sense to consider first the p-gons for p 
a prime. The complete answer even to this question will not be revealed until §15.. It turns out that the 
p-gons that can be constructed are extremely rare. Nevertheless, the first two (odd) primes do work. 

Exercise 69 Show that a regular 3-gon, ie: an equilateral triangle, can be constructed with any side length. Using Exercises 4 and 
67, show that a regular 5-gon can also be constructed. 

(7.14) Here is a "proof" that a regular 17-gon is constructible. Gauss proved the following remarkable 
identity, which is still found in trigonometric tables: cos — = 



1 



17(17- Vl7) _ / 17 yi7 _ 4 / 4 + 2V77 + 3VT7+17+ / 4 + 2VT7+V77+i5 



Thus the number cos7r/17 can be constructed as this expression involves only integers, the four field 
operations and square roots, all of which are operations we can perform with a ruler and compass. Hence, 
by Exercise 65(2) the angle 7r/17 can be constructed and so adding it to itself gives the angle 27r/17. 
Exercise 67 then gives that the 17-gon is constructible. 

Further Exercises for §7. 

Exercise 70 Using the fact that the constructible numbers include Q, show that any given line segment can be trisected in length. 

Exercise 71 Show that if you can construct a regular n-sided polygon, then you can also construct a regular 2 fc n-sided polygon 
for any k > 1. 

Exercise 72 Show that cos 9 is constructible if and only if sin 9 is. 

Exercise 73 If a, b and c are constructible numbers (ie: in K), show that the roots of the quadratic equation ax 2 + bx + c are also 
constructible. 
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8. Linear Algebra I: Dimensions 



We have met rings and and fields so far in our study of Galois Theory. Time for our third algebraic object: 
vector spaces. 

(8.1) A vector space over afield F is a set V of vectors together with two operations: addition u, v i— * 
u + v of vectors and scalar multiplication A, v i— > Xv of a vector by an element A of the field F, such that, 



1. 


(u + v) + iu = u + (v + w), for all u, v, w E V; 




2. 


There exists a zero vector: E V s.t. v + = v = 


4- ^ for all d e V, 


3. 


Every v E V has a negative — u s.t. v + (— v) = = 


—v + v, for all v E V 


4. 


u + w = u + u, for all u,t)€F, 




5. 


X(u + v) = Xu + Xv, for all u, v and X E F; 




6. 


(A + = Xv + [iv, 




7. 


X(fiv) = (Xfx)v, 




8. 


lv = v for 1 E F. 





Aside. Alternatively we can say that the set V of vectors forms an Abelian group under + (these are the first four axioms) together 
with the scalar multiplication which satisfies the last four axioms. 

(8.2) The set R 2 of 2 x 1 column vectors is a well known real vector space under the normal addition and 
scalar multiplication of vectors. Alternatively, the complex numbers C form a vector space over R, and 
of course these two spaces are the same space after making the identification, 

. <-> a + b i . 
b 

The complex numbers also form a vector space over themselves: addition of complex numbers gives an 
Abelian group and now we can scalar multiply a complex number by another one, just using the usual 
multiplication of complex numbers. It may seem a little confusing (especially the idea of thinking of a 
complex number as being both a vector and a scalar) but from a purely formal point of view, it satisfies 
the axioms and so is an admissible example. As we shall see below, the idea of thinking of the same set 
of objects as a vector space over two different fields is an important one for Galois Theory. 



(8.3) We can consider vector spaces over finite fields too: 

Consider the set of all 3-tuples where the coordinates come from the 
field F2, so are either or 1, and add two such coordinate-wise, using 
the addition from F2. Scalar multiply a tuple coordinate-wise using the 
multiplication from F2. As there are only two possibilities for each co- 
ordinate and three coordinates in total, we get a total of 2 3 = 8 elements 
in this space. Indeed, the elements can be arranged around the vertices 
of a cube as shown at left, where we have abbreviated so that abc is the vector with the three coordinates 
a,b,c E F 2 . 




(8.4) We saw in §4. that the field Q(\/2) consisted precisely of those elements of C of the form a + b\[2 
for a, b E Q. By making the identification, 



bV2 



coordinate in "1 direction" 
- coordinate in "V2 direction" 
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we realise Q(V2) as a vector space over Q. It is easy to check that the vector space operations match up 
with (a + bV2) + (c + d\/2) = (a + c) + (b + d)V2 corresponding to, 



and c(a + b\/2) 



a 




c 




a - 


\-c ' 


b 


+ 


d 






-d 



bc^/2 corresponding to, 



a 




ac 


b 




be 



(8.5) The polynomial x 3 — 2 is irreducible over Q so that Q[x]/ (x 3 — 2) is a field with elements having 
the form (a + bx + ex 2 ) + (x 3 — 2). It becomes a Q-vector space under the identification, 



(a + bx + ex 1 ) + (x 3 - 2) 



-3 



{x' J — 2) direction" 
(x 3 — 2) direction" 

„2 i /_3 



coordinate in "1 -i 
coordinate in "x 

coordinate in "a; 2 + (x 3 — 2) direction" 



(Check for yourself that the addition and scalar multiplications match up). 

(8.6) The previous two are special cases of the following situation: if F C E is an extension of fields 
then E can be turned into a vector space overF in the following way: the "vectors" are the elements of 
E and the scalars are obviously the elements of F. Addition of vectors is just the addition of elements in 
E, and to scalar multiply a jj € £ by a A e f , just multiply Aw using the multiplication of the field E. 
That the first four axioms for a vector space hold follows from the addition of the field E, and the second 
four from the multiplication of the field E. 

(8.7) Some more fundamental notions to do with vector spaces: for v\, . . . , v n G V vectors, any vector 
of the form 

ceivi + . . . + a n v n , 

for ot\, . . . , ct n € F, is a linear combination of the v\, . . . , v n . The linear span of V\, . . . , v n £ V is the 
set of all linear combinations of these vectors, 

span{t>i, . . . , v n } = < ^2 a i v ] -dj £ F>. 

Say Vi, . . . ,v n span V when V = span{«i, . . . , v n }. 

A set of vectors v\, . . . , v n G V is linearly dependent if and only if there exist scalars ct\, . . . , a n , not 
all zero, s.t. 

aivt + ... + a n v n = 0. 

The vectors v\, . . . , v n are linearly independent otherwise, ie: whenever ct\V\ + . . . + a n v n = implies 
that the on are all 0. 

(8.8) In the examples above, the complex numbers C are spanned, as a vector space over R, by the two 
elements {l,i}, and indeed by any two complex numbers that are not scalar multiples of each other. As 
a vector space over C, the complex numbers are spanned by just one element, for example, any element 
( G C can be written as ( x 1, so that every element is a complex scalar multiple of 1. Indeed, C is 
spanned as a complex vector space by any single one of its elements, except for 0. The moral is that in 
changing the field of scalars, you need to keep your wits about you. 

(8.9) A basis for V is a linear independent set {vj : j G J} (here J is a not necessarily finite index set), 
that spans V. We say V is finite dimensional if it has a finite basis. 

It can be proved that there is a 1-1 correspondence between the elements of any two bases for a vec- 
tor space V. Correspondingly, whenever V is finite dimensional we define the dimension of V to be 
dim(V) = number of elements in any basis. 
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(8.10) Thus C is 2-dimensional as a vector space over R but 1 -dimensional as a vector space over C. We 
will see later in this section that C is infinite dimensional as a vector space over Q. 

With the other examples above, Q(\/2) is 2-dimensional over Q with basis {1, \/2} and Q[x]/ (x 3 - 2) 
is 3-dimensional over Q with basis the cosets 

1 + (x 3 - 2),x+(x 3 - 2) and x 2 + (x 3 - 2). 

In Exercise 128 in §14., we will see that if a — \/2, then Q(a,i) is a 2-dimensional space over Q(a) 
or Q(m) or even Q((l + i)a); a 4-dimensional space over Q(i) or Q(ia 2 ), and an 8-dimensional space 
over Q (and these are almost, but not quite, all the possibilities; see the exercise for the full story). 

(8.11) Vector spaces, like groups, rings and fields, are algebraic objects, and so like these other examples, 
there is a notion of a homomorphism of vector spaces. This is a map tp : V\ — > V2 that preserves any 
operations we may have, which in the case of vector spaces is the addition and scalar multiplications: 

ip(u + v) = <p(u) + ip(v), p(Xv) = \<p(v) for u, v e V and A e F. 

For historical reasons, homomorphisms of vector are more commonly called linear maps. 

Aside. Although we don't need these concepts here, there is an algebraic theory of vector spaces akin to that for groups, rings and 
fields. For example, linear maps have kernels, there are quotients of vector spaces, and so on. There is a first isomorphism theorem 
for vector spaces, which as usual reads as V/ker tp = image tp. We get things like dim( Vi /V2 ) = dim(Vi ) — dim( V2 ), so in 
particular for a linear map, 

dim(V) = dim(keri,5) + dim(image tp). 

In the linear theory of vector spaces (rather than the algebraic theory), dim(image tp) is called the rank, dim(kerip) the nullility 
and dim(V) the number of columns of a matrix. So the first isomorphism theorem for vector spaces translates into the mantra, 
"rank + nullility = the number of columns". 

(8.12) Let F C E be an extension of fields. Consider Basa vector space over F, and define the degree 
of the extension to be the dimension of this vector space, denoted [E : F]. Call F C E a finite extension 
if the degree is finite. 

(8.13) The extensions Q C Q(v / 2) and Q C Q[x}/(x 3 - 2) have degrees 2 and 3. 

(8.14) It is no coincidence that the degree of extensions of the form F C F[x]/ (/) turn out to be the 
same as the degree of the polynomial / as the next result shows. 

Theorem 5 Let f be an irreducible polynomial in F[x] of degree d. Then the extension, 

FCF[x]/(f), 

has degree d. 

Hence the name degree! 

Proof: Replace, as usual, the field F by its copy in F[x]/(f), so that A £ F becomes A + (/). Consider 
the set of cosets, 

B = {1 + (f),x + (/), x 2 + </),..., x d ^ + (/)}. 

Then B is a basis for F[x]/(f) over F, for which we have to show that it spans the field/vector space and 
is linearly independent. To see that it spans, consider a typical element, which has the form, 

.9 + (/) = (q.f + r)(f) = r + (/) = (oo + an + ■ ■ ■ + fld-i^ 1 ) + (/). 
using the division algorithm and basic properties of cosets. This is turn gives, 

(ao+ ai x+- ■ ■+a d - 1 x d - 1 ) + (f) = (ao+</))(l+</))+(ai+</))(x+</))+. • •+(a d -i + (/»(.x d - 1 + (/», 
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where the last is an F-linear combination of the elements of B. Thus this sets spans the space. 

For linear independence, suppose we have an F-linear combination of the elements of B giving zero, 

(bo + (/))(! + (/)) + (h + (f))(x + (/}) + • • • + (&„_! + (Z))^" 1 + (/)) = (/}, 

remembering that the zero of the field F[x]/(f) is the coset + (/) = (/). Multiplying and adding all 
the cosets on the left hand side gives, 

(b Q + b 1 x + --- + b d _ 1 x d - 1 ) + (f) = (/), 

so that b + b\x + • • • + bd-\x d ~ x e (/) (using another basic property of cosets). The elements of (/), 
being multiples of /, must have degree at least d, except for the zero polynomial. On the other hand 
b + b\x + • • • + bd-\x d ~ x has degree < d — 1. Thus it must be the zero polynomial, giving that all the 
bi are zero, and that the set B is linearly independent over F as claimed. □ 

(8.15) What is the degree of the extension Q C Q(n)l If it was finite, say [Q(w) : Q] = d, then any 
collection of more than d elements would be linearly dependent. In particular, the d + 1 elements, 

1,7T,7T 2 , . . . ,7T d , 

are dependent over Q, so that a + ain + a 2 n 2 + . . . + adir d = for some a , oi, . . . , e Q, not all 
zero, and so 7r is a root of the polynomial ao + a\x + a 2 x 2 + . . . + a,dx d , which contradicts tt being 
transcendental over Q. Thus, the degree of the extension is infinite, and so for Q C Q(a) to be finite, we 
clearly cannot have that a is transcendental over Q. 

(8.16) In fact this is always true: 

Proposition 3 Let F C E and a G E. If the extension F C F(a) is finite, then a is algebraic over F. 

Proof: The proof is very similar to the example above. Suppose that the extension F C F(a) has degree 
n, so that any collection of n + 1 elements of F(a) must be linearly dependent. In particular the n + 1 
elements 

1, a, a 2 , . . . , a n 
are dependent over F, so that there are a , a\ , . . . , a n in F with 

a + a\a H h a n a n = 0, 

giving that a is algebraic over F as claimed. □ 

Thus, any field E that contains transcendentals over F will be infinite dimensional over F. In particular, 
R and C are infinite dimensional over <Q>. 

(8.17) The converse to Proposition 3 is partly true, as we summarise now in an important result 

Theorem D (Complete Description of Simple Extensions). Let F C E and a e E be algebraic over 
F. Then, 

1. There is a unique polynomial f G F[x] that is monic, irreducible over F, and has a as a root; 

2. The simple extension F(a) is isomorphic to the quotient F[x]/(f); 

3. ifdegf = d, then the extension F C F(a) has degree d with basis {I, a, a 2 , . . . , a d_1 }, and so, 

F(a) = {a + aid + a 2 a 2 + h a d -\a d ^ x | a , . . . , a^-i € F}. 



46 



Proof: Hopefully most of the proof will be recognisable from the specific examples we have discussed 
already. As a is algebraic over F there is at least one F-polynomial having a as a root. Choose /' to 
be a non-zero one having smallest degree. This polynomial must then be irreducible over F, for if not, 
we have /' = gh with deg(g), deg(h) < deg(f'), and a must be a root of one of g or h, contradicting 
the original choice of /'. Divide through by the leading coefficient of /', to get /, a monic, irreducible 
(by Exercise 17) F-polynomial, having a as a root. If /i, f 2 are polynomials with these properties then 
f\ — fi has degree strictly less than either /i or / 2 and still has a as a root, so the only possibility is that 
fi — fi is zer °. hence / is unique. 

Consider the evaluation homomorphism e a : F[x] — ► E defined as usual by s a (f) = f(a). To show 
that the kernel of this homomorphism is the ideal (/) is completely analogous to the example at the 
beginning of Section §5.: clearly (/) is contained in the kernel, as any multiple of / must evaluate to zero 
when a is substituted into it. On the otherhand, if h is in the kernel of e a , then by division algorithm, 

h = qf + r, 

with dcg(r) < deg(/). Finding the e a image of both sides gives = e a (h) = e a (qf) + e a (r) = e a (r), 
so that r has a as a root. As / is minimal with this property, we must have that r = 0, so that h = qf, ie: 
h is in the ideal (/), and so the kernel is contained in this ideal. Thus, kere Q = (/). 
In particular we have an isomorphism : F[x]/ (/) — > lme a , given by, 

£^(.9+ (/)) = e a (g) = 9(a), 

with the left hand side a field as / is irreducible over F. Thus, Ime a is a subfield of E. Clearly, both 
the element a (e a (x) = a) and the field F (e a (X) = A) are contained in Ime a , hence F(a) is too 
as \me a is subfield of E, and F(a) is the smallest one enjoying these two properties. Conversely, if 
g = J2 a i x% *= F[ x ] tnen £ a(g) = a i a% , which is an element of F(a) as fields are closed under sums 
and products. Hence Ime a C F(a) and so these two are the same. Thus e2 is an isomorphism between 
F[x]/(f) and F(a). 

This final part follows immediately from Theorem 5, where we showed that the set of cosets 

{1 + (/}, x + (f),x 2 + (/),..., x*- 1 + (/}}, 

formed a basis for F[x]/ (/) over F. Their images under s^, namely {1, a, a 2 , . . . , must then 

form a basis for F(a) over F. □ 

Notice from the proof of the first part of Theorem D that the polynomial / has the smallest degree of 
any polynomial having a as a root. For this reason it is called the minimum polynomial of a over F. 

(8.18) An important property of the minimum polynomial is that it divides any other F-polynomial that 
has a as a root. Suppose that g is such an F-polynomial. By unique factorisation in F[x], we can 
decompose g as 

9 = A/1/2 • • • fk, 

where the /» are monic and irreducible over F. Being a root of g, the element a must be a root of one of 
the fi. By uniqueness, this /j must be the minimum polynomial of a over F. 

(8.19) It is labouring the point, but to find the degree of a simple extension F C F(a), you want to find 
the degree of the minimum polynomial over F of a. 

How do you find this polynomial? Its simple: guess! A sensible first guess is a polynomial with 
F-coefficients that has a as root. If your guess is also monic and irreducible, then you have guessed 
right-Theorem D says there is only once such polynomial! If your guess is not monic, then replace it by 
a suitable scalar multiple. 

Thus, the only way you can go wrong is if you inadvertently guess a polynomial that is not irreducible. 
In this case, your next guess should be a factor of your first guess. In this way, the search for minimum 
polynomials is no harder than determining irreducibility. 
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(8.20) As an example of this process, consider the minimum polynomial over Q of the p-th root of 1, 

2tt . 2tt 

cos |-«sin — , 

P P 

forp a prime. Your first guess is x p — 1 which satisfies all the criteria bar irreducibility as x — 1 is a factor. 
Factorising, 

x p - 1 = (x - l)$ p (x), 
for $ p the p-th cyclotomic polynomial shown to be irreducible over Q in Exercise 29. 

(8.21) How does one find the degree of extensions F C F(a>i , . . . , a k ) that are not simple, but the result 
of adjoining several elements? Such extensions are just a sequence of simple extensions, one after the 
other. If we can find the degrees of each of these simple extensions, all we need is a way to patch the 
answers together. The result that does this is called the, 

Tower Law. Let F C E C L be a sequence or "tower" of extensions. If both of the intermediate 
extensions F C E and E C L are finite, then F C L is too, and indeed 

[L:F] = [L: E][E : F}. 

(8.22) Before tackling the proof of the tower law, consider the example of the extension Q C Q(v^2, i), 
which is nothing other than a sequence of two simple extensions, 

QC Q(#2) C Q(#2,i). 

We can use Theorem D to find the degrees of each of these individual simple extensions. Firstly, the 
minimum polynomial over Q of \/2 must a; 3 — 2, for this polynomial is monic in Q[x] with \/2 as a root 
and irreducible over <Q> by Eisenstein (using p = 2). Thus the first of the two extensions above has degree 
deg(x 3 - 2) = 3 and {1, ^2, ( ^2) 2 } is a basis for Q( $2) over Q. 

Now let F = Q( \/2) so that the second extension is F C F(i) and where the minimum polynomial of 
i over F must be x 2 + 1: it is monic in ¥[x] with i as a root, and irreducible over F as its two roots ±i are 
not in F ( as F C R). Thus Theorem D again gives that F C F(i) is a degree deg(x 2 + 1) = 2 extension 
with { 1 , i} a basis for F(i) over F. 

Now consider the elements, 

{l,^2,(^2) 2 ,z,^,(^2)M, 

obtained by multiplying the two bases together. The claim is that they form a basis for Q( y/2, i) = F(i) 
over Q, so we need to show that the Q-span of these six is all of this field and that they are linearly 
independent over Q. For the first, let x be an arbitrary element of Q(v^2, i) = F(i). As {1, i} is a basis 
for ¥(i) over F, x can be expressed as an F-linear combination, 

x = a + bi, a, b £ ¥. 

As {1, \/2, (s/2) 2 } is a basis for F over Q, both a and b can be expressed as Q-linear combinations, 

a = a + oi \/2 + a 2 (^2) 2 , b = b Q + &i^2 + b 2 (^2) 2 , 

with the at, bi G Q. This gives, 

x = a + oi ^2 + a 2 {\/2) 2 + b Q i + b 1 \/2i + 6 2 (^2) 2 i, 

a Q-linear combination for x, and so these six elements do indeed span the Q-vector space Q(v^2, i). 
Suppose we have, 

a + oi \f2 + a 2 (v / 2) 2 + b i + b ia3 \/2i + b 2 (\/2) 2 i = 0, 
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with the dj, bi G Q. Gathering together real and imaginary parts, 

(ao + ai^2 + a 2 (^2) 2 ) + (60 + + b 2 {</2f)i = a + bi = 0, 

for a and b now elements of F. As {1, i} independent over F we must have that the coefficients in this 
last expression are zero, ie: that a = b = 0. This now gives, 



+ ai^2 + a 2 (^2) 2 = = b Q + + 6 2 (v / 2) 2 , 



and as {1, \/2, ( \ // 2) 2 } are independent over Q we deduce that all the coefficients in these two expressions 
are zero, ie: that a = a\ = a 2 = bo = b\ = b 2 = 0, so that the six elements are independent and form a 
basis as claimed. 

This certainly agrees with the answer given to us by the tower law in that, 



6 - [Q(S/2, i) : Q] = 3 x 2 = [Q(^2, i) : 

(8.23) The example above is more than a specific verification of the tower law. It also shows us exactly 
how to prove it: 

Proof: Let {a\, a 2 , . . . , a n } be a basis for E as an ^-vector space and {/3\,(3 2 , . . . , (3 m } a basis for 
L as an £?-vector space, both containing a finite number of elements as these extensions are finite by 
assumption. We then show that the ran elements 

{aij3j}, 1 < i < n, 1 < j < m, 

form a basis for the ^-vector space L. Working "backwards" as in the example above, if x is any element 
of L we can express it as an ^-linear combination of the {(3\, . . . , f3 m }, 



x = 

i=l 



where, as they are elements of E, each of the at can be expressed as F-linear combinations of the 

{ai,a 2 , . . .,«„}, 

n m n 

j=i j=i j=i 

Thus the elements span the field L. If we have 

m n 

»=i i=i 

with the bij e F, we can collect together all the /3\ terms, all the f3 2 terms, and so on (much as we took 
real and imaginary parts in the example), to obtain an ^-linear combination, 

E hl 3 a i ) & + ( E b2 i a i ) ^ 2 + ' ' ' ( E bm i a i ) = °- 
The independence of the /3j over _E forces all the coefficients to be zero so that 



( E hl i a i ) = ' ' ' = ( E bm i a i ) = 



and the independence of the aj over F forces all the coefficients in each of these to be zero too, ie: 
hj = for all □ 
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Further Exercises for §8. 

Exercise 74 Show that the following two fields are isomorphic: 

/ 2vr 2n \ ( 4tt 4tt \ 

Q cos h sin — i and Q cos h sin — i 

vp v J vp p/ 

where p is an (odd) prime number. 
Exercise 75 

1 . Show that if F C L are fields with [L : F] = 1 then L = F. 

2. Let F C L C F be fields with [F : F] = [L : F]. Show that E = L. 

Exercise 76 Let F = Q(a), where a 3 = 2. Express (1 + a) -1 and (a 4 + l)(a 2 + f ) _1 in the form ba' 2 + ca + d, where ft, rf, c 
are in Q. 

Exercise 77 Let a = \f5. Express the following elements of Q(a) as polynomials of degree at most 2 in a (with coefficients in 

Q): 

1. 1/a. 

2. a 5 - a 6 . 

3. a/(a 2 + l). 

Exercise 78 Find the minimum polynomial over Q of a = \/2 + \/—2. Show that the following are elements of the field Q(a) 
and express them as polynomials in a (with coefficients in Q) of degree at most 3: 

1. y/2. 

2. ^2. 

3. i. 

4. a 5 +4a + 3. 

5. 1/a. 

6. (2a + 3)/(a 2 + 2a + 2). 

Exercise 79 Find the minimum polynomials over Q of the following numbers: 

1. 1 + 

2. ^7. 

3. ^5. 

4. y/2 + i. 

5. V2+ ^3. 

Exercise 80 Find the minimum polynomial over Q of the following: 

1. V7. 

2. (^ + 3)72. 

3. (i-v/3- l)/2. 

Exercise 81 For each of the following fields L and F, find [L : F] and compute a basis for L over F. 

1. L = Q(V2, ^2), F = Q; 

2. L = Q(^2,i),F = Q(i); 

3. L = Q(g). F = Q, where ^ is a primitive complex 7th root of unity; 

4. L = Q(i, V3, w), F = Q, where w is a primitive complex cube root of unity. 

Exercise 82 Let a = e"/ 4 . Find [F(a) : F] when F = R and when F = Q. 
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§9. Fields III: A Menagerie 

This section collects together a number of miscellaneous but important concepts and examples of fields. 
Splitting Fields 

(9.1) In the first lecture we were interested in fields containing just enough numbers to solve some poly- 
nomial equation. 

Suppose / is a polynomial with F coefficients. We say that / splits in an extension F C E iff we can 
factorise 

dog/ 

/ = n ( x - a i)> 

i=l 

in the polynomial ring E[x], Thus f splits in E precisely when E contains all the roots {a\,a2, ■ ■ ■ , ctdeg/} 
off. 

There will in general be many such extension fields: we are after the smallest one. Call E a splitting 
field for / over F, if / splits in E and E — F(ai,ct2, ■ ■ ■ , ayeg/)> where {a\, a>2, ■ ■ ■ , ayeg/} are the 
roots of /. 

Exercise 83 Show that E is a splitting field of the polynomial / over F if and only if / splits in E but not in any subfield of E 
containing F (so in this sense, E is the smallest field containing F and all the roots). 

(9.2) Our example from the first lecture again: the polynomial was x 3 — 2 with roots a, aw, au; 2 where 
a = \[2 e M and 

1 , V3. 

LJ = —2 + — 1 - 

Thus a splitting field for / over Q is given by Q(a, aw, au 2 ), which is the same thing as Q(a, w). 

(9.3) The example above shows that we can always find a splitting field for a polynomial over F: by 
Kronecker's Theorem we can find an extension E of F that contains all the roots {a\, a^, ■ ■ ■ , ayeg/} of 
/, and so adjoining them to F gives a splitting field F(a\,a2, ■ ■ ■ , ayeg/) C E. 

Aside. In §12. we will prove (Theorem ??) that an isomorphism of a field to itself a : F — + F can always be extended to an 
isomorphism a : E\ —* E2 where E\ is a splitting field of some polynomial / over F and E2 is another splitting field of this 
polynomial. Thus, any two splitting fields of a polynomial over F are isomorphic. 

Finite Fields 

We have already met a number of examples of finite fields: the ¥ p of course, and a few others such as the 
field of order 8 in §4.. 

(9.4) To get more examples, we saw in §6. that by taking irreducible polynomials over finite fields we 
could, in principle, construct fields with a prime power number of elements. The idea was to find a 
polynomial of degree n, irreducible over the field F p , giving a field of order p". Here is a very concrete 
example of that idea. 

Consider the polynomial f = x 2 + x + 2 € V 3 [x]. Substituting the three elements of F 3 into / gives 

2 + + 2 = 2, l 2 + 1 + 2 = 1 and 2 2 + 2 + 2 = 2, 

so that / has no roots in F3. Being a quadratic, this gives that / is irreducible over the field F3, and so 
F 3 [a;]/(a; 2 + x + 2) is a field of order 3 2 called, say F 9 . 

Let a = x + (x 2 + x + 2) in F 9 be a generator for this field as in §6., so that the elements of F 9 have 
the form a + ba with a, b e F 3 and multiplication satisfying the rule a 2 + a + 2 = 0, or equivalently 7 , 

7 Note that -1 = 2 and -2 = 1 in F 3 . 
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a 2 = 2a + 1. Now let X be a new indeterminate, and consider the polynomials Fg[X] over Fg in this 
new variable. In particular, 

g = X 3 + (2a+l)X + l. 

As g is a cubic, it will be irreducible over F 9 precisely when it has no roots in this field, which can be 
verified as usual by a straight, albeit tedious, substitution: 

.9(0) = 1, 

. 9 (1) = l 3 + 2a+ 1 + 1 = 2a, 
5 (2) = 2 3 + 2(2a+l) + l = a + 2, 

g(a) = a 3 + a(2a + 1) + 1 = a(2a + 1) + a(2a + 1) + 1 = 4a 2 + 2a + 1 = 2a + 1 + 2a + 1 

= a + 2, 

g(a + 1) = (a + if + (a + l)(2a + 1) + 1 = a 3 + 1 + 2a 2 + 1 + 1 = a(2a + 1) + 2(2a + 1) 

= (a + 2) (2a + 1) = 2(a 2 + a + 1) = 1, 
g(a + 2) = (a + 2) 3 + (a + 2)(2a + 1) + 1 = a 3 + 2 + {a + 2)(2a + 1) + 1 = (2a + 2)(2a + 1) 
= a 2 + 2 = (2a + 1) + 2 = 2a, 
g(2a) = (2a) 3 + 2a(2a + 1) + 1 = 2a(2a + 1) + 2a(2a + 1) + 1 = a(2a + 1) + 1 
= 2a 2 + a + l= a + 2 + a + l = 2a, 
g(2a + 1) = (2a + l) 3 + (2a + l)(2a + 1) + 1 = 2a 3 + 1 + a 2 + a + 1 + 1 = 2a(2a + 1) + a 2 + a 

= a 2 + 2a + a 2 + a = a + 2, 
g(2a + 2) = (2a + 2) 3 + (2a + 2)(2a + 1) + 1 = 2a 3 + a + a 2 + 2 + 1 = 2a(2a + 1) + a + a 2 
= 2a 2 = a + 2. 

We have a used an energy saving device in these computations as summarised in the following exercise. 

Exercise 84 If F is a field of characteristic p > 0, then (a + b) p = aP + V (hint: refer to Exercise 27). 

Thus g is irreducible over Fg, giving a field 

¥ 9 [X]/(X 3 + (2a + l)X + 1) 

of order 9 3 = 3 6 = 729, called say F729. As we have a sequence of extensions F 3 C F 9 C F7 2 g, we can 
view F729 in two ways. Using the extension F 9 C F7 2 g, the elements have the form, 

A + A^ + A 2 (3 2 , 

where the Ai <G F 9 and j3 = X + (g). Multiplication uses the rule j3 3 = (a + 2)/3 + 2. Alternatively, the 
extension F 3 C F7 29 has, by the tower law, elements of the form, 

a + ai/3 + a 2 l3 2 + a 3 a + a^af) + a 5 a(3 2 , 

with the fli € F3. 
Exercise 85 

1. Construct a field Fg with 8 elements by showing that rr 3 + x + 1 is irreducible over F2. 

2. Find a cubic polynomial that is irreducible in Fg [x] (hint: refer to Exercise 26). 

3. Hence, or otherwise, construct a field with 2 9 = 512 elements. 
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(9.5) Recall that the prime subfield of a field is the smallest subfield, and is isomorphic to F p for some p 
or to Q. In particular, the prime subfield of a finite field F must be isomorphic to F p . 

Using the ideas from §8., we have an extension of fields F p C F and hence the finite field F forms a 
vector space over the field F p . This space must be finite dimensional (for F to be finite), so each element 
of F can be written uniquely as a linear combination, 

a\a\ + a 2 a 2 + • • • + a n a n , 

of some basis vectors ai, a>2, . ■ ■ , a n with the a, £ F p . In particular there are p choices for each a,, and 
the choices are independent, giving p n elements of F in total. 
Thus, a finite field must have p n elements for some prime p. 

(9.6) Here is an extended example that shows the converse, ie: gives a standard construction of a field 
with q = p n elements for any prime p and positive integer n. Consider first the polynomial x q — x over 
the field F p of p elements. 

Let L be an extension of the field ¥ p containing all the roots of the polynomial, as guaranteed us by the 
Corollary to Kronecker's Theorem. In Exercise 16 we used the formal derivative whether a polynomial 
has distinct roots. We have that d(x q — x) = qx^ 1 — 1 = p n x pn ~ 1 — 1 = — 1 as p n = in F p . Clearly 
the constant polynomial — 1 has no roots in L, and so the original polynomial x q — x has no repeated 
roots in L by Exercise 16. 

In fact, the p n distinct roots of x q — x in L form a subfield, and this is the field of order p n that we are 
after. To show this, we need that if A, \x are roots, then so are — A, A + p, X^i and A -1 . 

Firstly, (—A) 9 — (—A) = (— 1) 9 A 9 + A. If p is a prime, then it is either p = 2 or odd, in which case we 
have two cases to consider, \fp~2, then — 1 = 1 in F 2 , so that (— l) q \ q + A = X q + A = A + A (as A is 
a root ofx q — x so that X q = A) = 2A = 0. If pis odd then (-1) 9 = -land {-l) q \ q + \ = -X q + X = 
—A + A = 0. In either case, —A is also a root of the polynomial x q — x. 

Next, 

(A + /u) 9 = ^2 ( \ ) - A 9 + m 9 + pother terms), 

where in F p we have that q = p n = 0. Thus (A + p) 9 = X q + p q . Substituting A + fi into our polynomial 
then gives 

(A + M ) 9 - (A + n) = X q + p q - X - p = 0, 

as both A and /i are roots so that X q — X = = pfl — fi. Thus A + p is also a root of the polynomial. 

Now, {Xp) q -Xfj, = X q fi q - Xn = Xn - Xp = 0. Finally, (A -1 ) 9 - (A" 1 ) = (A 9 )" 1 - (A" 1 ) = 
A -1 — A -1 = 0. In both cases we have used A 9 = A. 

Thus the q = p n roots of the polynomial in L form a subfield as claimed, and we have constructed a 
field with this many elements. 

(9.7) Looking back at this example, we let L be an extension of F p containing all the roots of the polyno- 
mial x q — x. In particular, if these roots are {ai, . . . , a q }, then F p (ai, . . . , a q ) is a, hence the, splitting 
field over F p of the polynomial. In the example we constructed the subfield F of L consisting of the roots 
of x q — x. As any subfield contains F p , we have F p (ai, . . . , a q ) C F, whereas F = {ai, . . . , a q } so that 
F C ¥ p (a\ , . . . , a q ). Hence the field we constructed in the example was the splitting field over F p of the 
polynomial x q - q. 

If F is now an arbitrary field with q elements, then it has prime subfield F p . Moreover, as the mul- 
tiplicative group of F has order q — 1, by Lagrange's Theorem (see §11.), every element of F satisfies 
x 9_1 = 1, hence is a root of the F p -polynomial i , = i§i'-i = 0. Thus, a finite field of order q is 
the splitting field over F p of the polynomial x q — x, and by the uniqueness of such things, any two fields 
of order q are isomorphic. 

(9.8) We finish with a fact about finite fields that will prove useful later on. Remember that a field 
is, among other things, two groups spliced together in a compatible way: the elements form a group 
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under addition (the additive group) and the non-zero elements form a group under multiplication (the 
multiplicative group) . 

Looking at the complex numbers as an example, we can find a number of finite subgroups of the 
multiplicative group C* of C by considering roots of 1. For any n, the powers of the n-th root of 1, 

2tt 2tt 

lu = cos h i sin — , 

n n 

form a subgroup of C* of order n. Indeed, by definition, this subgroup is cyclic. 

Proposition 1 Let F be any field and G a finite subgroup of the multiplicative group F* of F. Then G is 
a cyclic group. 

In particular, if F is now a finite field, then the whole multiplicative group F* of F is finite. Hence the 
multiplicative group of a finite field is cyclic. 

Proof: By Exercise 97 the order in an Abelian group of the element gh is the lowest common multiple 
of the orders of g and h. As G is finite we can write a list 1 = mi, m,2, ■ ■ ■ , mk of all the possible orders 
of elements and find 1 = gi,g 2 , ■ ■ ■ ,9k such that gt has order rm. Thus gig 2 ■ ■ ■ gk has order the lowest 
common multiple of all the possible orders in the group. Thus, if we call this order m, there is an element 
g of the group of order m, and any other element h satisfies h m = 1. Hence every element of the group 
is a root of x m — 1, and since this polynomial has at most m roots in F, the order of G must be < m. As 
jeG has order m its powers must exhaust the whole group, hence G is cyclic. □ 

Algebraically closed fields 

(9.9) In the first part of this section we dealt with fields in which a particular polynomial of interest split 
into linear factors. On the otherhand, there are fields like the complex numbers in which any polynomial 
splits. 

A field F is said to be algebraically closed if and only if every (non-constant) polynomial over F splits 
in F. 

(9.10) If F is algebraically closed and a is algebraic over F then there is a polynomial with F-coefficients 
having a as a root. As F is algebraically closed, this polynomial splits in F, so that in particular a is 
in F. This explains the terminology: an algebraically closed field is closed with respect to the taking of 
algebraic elements. Contrast this with fields like Q, over which there are algebraic elements like \/2 that 
are not contained in Q. 

Exercise 86 Show that the following are equivalent: 

1 . F is algebraically closed; 

2. every non-constant polynomial over F has a root in F; 

3. the irreducible polynomials over F are precisely the linear ones; 

4. if F C E is a finite extension then E = F. 

Theorem 6 Every field is contained in an algebraically closed one. 

Proof (sketch): The full proof is beyond the scope of these notes, although the technical difficulties are 
not algebraic or number theoretical, but set theoretical. If the field is countable, the proof sort of runs as 
follows: there are countably many polynomials over a countable field, so take the union of all the splitting 
fields of these polynomials. Note that for a finite field, this is an infinite union, so an algebraically closed 
field containing even a finite field is very large. □ 
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Simple extensions 

(9.11) We saw in §4. that an extension like Q C Q(\^2, \/3) is, despite appearances, simple. It is certainly 
a finite extension, and this turns out to give simplicity as we now show: 

Theorem 7 Let F C E be a finite extension so that the roots of any irreducible polynomial f G E[x] are 
distinct. Then E is simple, ie: E = F '(a) for some a € E. 

The following proof is for the case that F is infinite. 

Proof: Let {a\ 1 a 2 , . . . , be a basis for E over F and consider the field F\ = F(a^, . . . , a.k), so that 
E = F\ (ai , a 2 ). We will show that Fi (ai , a 2 ) is a simple extension of F\, ie: that F\ (ai , a 2 ) = F\ (9) 
for some 9 G E. Thus E = F(a\, a 2 , . . . , a.k) = F(9, a% ... , at), and so by repeatedly applying this 
procedure, E is a simple extension. 

Let /1, /2 be the minimum polynomials over i<\ of ai and a 2 , and let L be an algebraically closed field 
containing of the field F. As the are algebraic over F, we have that the fields i*\ and £7 are contained 
in L too. In particular the polynomials /1 and / 2 split in L, 

dcg fi dcg / 2 

/i= n (*-a),/2= n 

with ft = ai and = a 2 . As the roots of these polynomials are distinct we have that 7^ /3j and 
5j 7^ (5j for all i 7^ j. For any z and any j 7^ 1, the equation, fa + xSj — /3\ + a;<5i has precisely one 
solution in F\, namely 

x _ Pi-Pi 
<5i - Sj ' 

(notice that if we had Sj — Si then there would be infinitely many solutions to the equation (3i + xSj — 
(3\ + xSi). As there only finitely many such equations and infinitely many elements of F\, there must be 
an c e Fi which is a solution to none of them, ie: such that, 

ft + cSj ^/3i+ cSi 

for any i and any j 7^ 1. Let 9 = (3\ + cSi = a\ + ca 2 , and we show that F\(ai, a 2 ) = F\(6) = 
Fi(ai + ca 2 ). 

Clearly ai +ca 2 e Fi(a\, a 2 ) so that + ca 2 ) C F\{a\, a 2 ). We will show that a 2 e Fi(ai + 

002) = Fi(9), for then if so, a\ +ca 2 — ca 2 = a\ € +ca2), and so Fi(ai, 02) C Fi(a\ +ca 2 ). 

We have = /i(ai) = - ca 2 ), so if we let r(i) G Fi(6»)[t] be given by r(i) = - ct), then 
we have that a 2 is a root of both r(t) and ,f 2 (x). If 7 is another common root of r and f 2 , then 7 is one 
of the Sj, and 9 — (being a root of /1) is one of the ft, so that, 

7 = $j and 9 - 07 = ft ft + c^- = ft + c5i, 

a contradiction. Thus r and / 2 have just the single common root a 2 . Let h be the minimum polynomial 
of a 2 over F\ (9), so that /i divides both r and / 2 (recall that the minimum polynomial divides any other 
polynomial having a 2 as a root). This means that h must have degree one, for a higher degree would give 
more than one common root for r and f 2 , ie: h = t + b for some b G Fi{9). As h(a 2 ) = we thus get 
that a 2 = —b and so a 2 G Fi{9) as required. □ 

The theorem is true for finite extensions of finite fields (even without the condition on the roots of 
the polynomials), but we omit the proof here. We saw in Exercise 36 that irreducible polynomials over 
fields of characteristic have distinct roots. Thus, any finite extension of a field of characteristic zero is 
simple. For example, if a\, . . . , a>k are algebraic over Q, then Q(ai, . . . , Qfe) = Q(9) for some 9. This is 
a fundamental fact in algebraic number theory, the proof of which we have merely adapted. 
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§10. Ruler and Compass constructions II 

(10.1) The degree of an extension is the only concept we need to completely answer the question of 
which complex numbers are constructible: 

Theorem E. The number Q G Cis constructible if and only if there exists a sequence of field extensions, 

Q = K CK 1 CK 2 C---CK n , 
such that Q(C) is a subfield of K n , and each Ki is a degree two extension of Ki—%. 



Proof: (=>) We prove the "only if" part first. Recall that £ is constructible if and only if there is a 
sequence of numbers 

0,1,«= Ci,C2, ■ ■ ■ ,Cn = C, 
with Q obtained from earlier numbers in the sequence in one of the three forms, 






withp, q,r,s G {1, 2, . 



1}. Let Kj be the field 



(j), giving a "tower" of extensions, 



CKi CK 2 C ••• CK n . 



We will show the following two things: (a), each of the fields Kj is closed under conjugation, ie: if 
z G Kj then z G Kj, and (b). the degree of each extension Kj_\ C Kj is at most two. Part (a) is just a 
technical convenience, the main point of which is illustrated by the exercise following the proof. We will 
prove it by induction: K\ = Q(i) is clearly closed under conjugation, so we assume that Ki is closed 
under conjugation if i < j. 

Suppose that Q is obtained as in case (i), ie: as the intersection of two straight lines. The Cartesian 
equation for one of the straight lines is y = rti\X + c\, and suppose this line passes through the points 
C P , ( q , with Cp, Cq <= Kj-\. As this field is closed under conjugation, Exercise 87 gives that the real and 
imaginary parts of ( p and £ g are in Kj_i too. As ( p , C, q lie on the line with this equation we have, 



ImCg = miReC, + Ci| 
ImCp = miRe£p + cA 



mi = I" 1 *f _ € Kj-i and ci = ImCp - miReCp G Kj-i 



(unless the line is vertical with equation x = c\, in which case c\ = Re^p G Similarly if the 

equation of the other line is y = mix + o%, we have rri2, C2 G Kj_\. As Q lies on both these lines we 
have 



ImCj = m 2 ReCj + c 2[ 
ImQ = rriiReCj + cA 



* C2 — c\ , T . mi (co — ci) 
mi,Ci€ Kj-i^ReQ = — — and ImQ = — — + c x , 



mi — m2 



mi — m 2 



and so Re^j and ImQ are in Kj-i as well. As this field is closed under conjugation we have that 
Q G Kj-i too, so that in fact Kj = Kj-i(Q) = Kj—\. Thus the degree of the extension Kj-i C Kj, 
being one, is certainly < 2. Moreover, Kj is closed under conjugation as Kj-\ is. 

For case (ii), suppose that the line has equation y = rnx + c and the circle equation (x — Re£ s ) 2 + 
(y — Im( s ) 2 = r 2 , where r 2 = (Re^ r — ReCs) 2 + (ImCr ~ l m (s) 2 , so that in particular r 2 G Kj-\ 
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as C P , C g , £ r , C s e ^j-i hence their real and imaginary parts are too. As Q lies on the line we have 
Im(j = mRe(j + c, and it lies on the circle too, so that, 

(Reg - ReC s ) 2 + (mRe^ + c - ImC s ) 2 = r 2 . 

Thus the polynomial (x — Re^) 2 + (mx + c — Im£ s ) 2 = r 2 is a quadratic with -Kj-i coefficients having 
Re£j as a root. As the minimum polynomial over Kj_\ of ReQ divides any other Kj-i -polynomial 
having ReQ as a root, we get that this minimum polynomial has degree < 2. Theorem D then gives, 

[^-i(ReO) : i^-i] < 2. 

In fact, lm(j e Kj-±(Re(j) as ImQ = mRe(j + c, thus Q itself is in Kj-i(Re(j), as i also is. Hence 
we have the sequence, 

Kj-x C Kj = C ^-i(ReO), 

giving that the degree of the extension Kj_\ C is also < 2 by the tower law. Finally, we show that 
the field Kj is closed under conjugation, for which we can assume that the degree is two (it is trivially 
the case if the degree is one). Now, Kj = Kj_i((j) = Kj_i(ReQ), so that in particular (j and its real 
part are in Kj, hence its imaginary part 

I.,0 = ™, 

is too. The upshot is that Re^ - ilmC^ = Q is in Kj, and as the elements of this field have the form 
a + b(j with a, b E -Kj-i, we get that it is indeed closed under conjugation. 
Finally, case (iii). As Q lies on both circles we have, 

(ReCj - ReG) 2 + {IraQ - lm( s ) 2 = r 2 and (ReQ - Re( p ) 2 + (lmQ - lm( p ) 2 = s 2 , 

with both r 2 and s 2 in for the same reason as in case (ii). Expanding both expressions, they contain 
terms of the form Re( 2 + Im^ 2 , and equating leads to, 

ImC = ^" Re + ^> where a = 2(ImCs - ImCp), Pi = 2(ReC P - Re^) 
and 02 = ReC 2 + ImC 2 - (ReC 2 + ImC 2 ) + s 2 - r 2 . 

Combining this ifj-i -expression for Im^ with the first of the two circle equations above puts us into a 
similar situation as part (ii), from which the result follows in the same way. 
{<=) Now for the "if" part. Suppose that we have a tower of fields, 

Q = K C K x C K 2 C • • • C K n , 

with Q(C) in K n . Each Kj is a simple extension Kj = Kj_i(Q), so Kj = Q(Ci, ■ • ■ ,(j)> an d m 
particular, K n — Q(Ci, • ■ ■ , C«)- We may as well assume that Q(C) is not contained in K n _\, so that 
C ^ K n _L As Q(Ci, . . . , Cn-i, C) C Q(Ci, • ■ • , Cn-i. Cn). we have that Q(Ci, • • • , Cn-i, C) is a degree 
two extension of Q(Ci, • ■ • , Cn-i)> so hy Exercise 75, part 2, 

Q(Cl,...,Cn-l,C)=Q(Cl,---,Cn-l,Cn)- 

Thus, the tower of extensions has the form, 

Q c Q(Ci) c ■ ■ ■ c Q(Ci, . . . , Cn-i) c Q(Ci, . . . , Cn-i, 0- 

It suffices to prove therefore, that whenever we have an extension K C K (6) of degree two, then there 
are finitely many elements of K from which 9 can be constructed in a finite number of steps. For if so, 
then Q can be constructed from finitely many elements of Q(Ci, ■ • ■ , Cn-i)> eac h of which in turn can be 
constructed from finitely many elements of Q(Ci, • ■ • , Cn-2). and so on. 
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Given K C K(8) as above then, the minimum polynomial of 9 over K has the form x 2 + bx + c, with 
b,cG K, so that 9 is one of, 

-1± V^ 2 -4c 
2 ' 

which can be constructed from 1, 2, 4, 6, c e if, using the arithmetical and square root constructions of 
§7. " □ 

Exercise 87 Let A" be a field such that 

Q(j) CfCC, 

as well as being closed under conjugation, ie: if z € K then z G X. Show that z £ if if and only if the real and imaginary parts 
of 2 are in K. 

(10.2) It is much easier to use the "only if" part of the Theorem, which shows when numbers cannot be 
constructed, so we restate this part as a separate, 

Corollary. If( e C is constructible then the degree of the extension Q C Q(£) must be a power of two. 

To use the "if" part, in otherwords, to show that numbers can be constructed by finding a tower of 
fields as in Theorem E, is a little harder. We will need to know a great deal more about the fields stacked 
in between Q and Q(C) before we can do this. The Galois Correspondence in §14. will give us the control 
to do this, so we postpone any attempts at using the "if" part of Theorem E until then. 

Proof: If C is constructible then we have the tower of degree two extensions as given in Theorem E, with 
C € K n . Thus we have the sequence of extensions Q C Q(Q C K n , which by the tower law gives, 

[K n :Q] = [K n :Q(C)][Q(C) : Q]- 
Thus [Q(C) : Q] divides [K n : <Q>], which is a power of two, so [Q(C) : Q] must also be a power of two. □ 

(10.3) Notice that the Corollary is only stated in one direction. Indeed, the converse, that if the extension 
has degree a power of two, then the number is constructible, is not true. 

(10.4) A regular p-gon, for p a prime, can be constructed, by Exercise ??, precisely when the complex 
number ( = cos(2ir/p) + i sin(27r/p) can be constructed, so we need to find the degree of the extension 
Q C Q(£). By Exercise 29, the minimum polynomial of ( over <Q> is the p-th cyclotomic polynomial, 

$ p (x) = x"- 1 + x p - 2 + ...+X + 1. 

Thus the degree of the extension Q C Q(() is p — 1, hence by the Corollary to Theorem E we require, 
for the p-gon to be constructible, that p — 1 is a power of two. In otherwords, the prime p is of the form 

p = 2 n + l. 

Actually, even more can be said. If m is odd, the polynomial x m + 1 has — 1 as a root, thus can be 
factorised as x m + 1 = (x + l)(a;' n_1 - x m ~ 2 + x m ~ 3 — ■ ■ ■ — x + 1). Thus if n = mk for m odd, we 
have 

2" + 1 = (2 fe ) m + 1 = (2 k + l)((2 fe ) m - 1 - (2 fe ) m - 2 + (2 k ) m - 3 (2 k ) + 1), 

so that 2™ + 1 cannot be prime unless n has no odd divisors, which means that n itself must be a power 
of two. 

Thus for a p-gon to be constructible, we must have that p is a prime number of the form 

2 2 * + 1, 

a so-called Fermat prime. Such primes are extremely rare: the only ones < 10 900 are 

3, 5, 17, 257 and 65537. 

We will see in §15. that the converse is true: if p is a Fermat prime, then a regular p-gon can be con- 
structed! 
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(10.5) A square plot of land can always be doubled in area using a ruler and compass: 

(V2t, V2t) 

(0,f) 



Whatever the side length t of the original square is, it is constructible: just set the compass to the side 
length. As y2 is also a constructible number, we can construct the point with coordinates {y2t, \/2t), 
hence doubling the area. 




What about a regular cube: is there a similar procedure? Suppose the original cube 
has side length 1, so that the task is to produce a new cube of volume 2. If this could 
be accomplished via a ruler and compass construction, then by setting the compass to 
the side length of the new cube, we would have constructed But the minimum 
polynomial over Q of \/2 is clearly x 3 — 2, with the extension Q C Q(v2) thus having 



degree three. Thus, such a construction cannot be possible. 



(10.6) The subset of W 1 given by 

□ n = {x 6 E™ | \xi\ < - for alii} 

is an n-dimensional cube of side length t having volume t n . In particular, in 4-dimensions we have the 
so-called hypercube, 



8-cell or hypercube, the vertices of which can be placed on 
the 3-sphere S 3 in ]R 4 . Stereograpically projecting S 3 to IR 3 
gives a picture as at right. It is the shadow cast by a hypercube 
on a 3-dimensional table top sitting in the 4-dimensional sun. 



which can always be doubled in volume because the point with coordinates {\/2t, y2i, \/2t, \/2t) can 
be constructed! 

(10.7) One of our basic ruler and compass constructions was to bisect an angle. It is therefore natural to 
ask if there is a construction that trisects an arbitrary angle. Certainly there are particular angles that can 
be trisected, for instance, if the angle <fi is constructible then the angle 3(f) can be trisected! 

However, the angle ir/3 cannot be trisected, as we show by demonstrating that the angle tt/9 cannot 
be constructed. 

Exercise 88 Evaluate the complex number (cos (f> + i sin <fr) 3 in two different ways: using the binomial theorem and De Moivre's 
theorem. By equating real parts, deduce that 

cos 3(f) = 4 cos 3 (j) — 3 cos (f>. 
Derive a similar expression for cos 5(f) and cos 7(f). What about the general case? 
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We have from Exercise ?? that the angle w/9 is constructible precisely when the complex number 
cos7r/9 can be constructed, for which it is necessary in turn that the degree of the extension Q C 
Q(cos7r/9) be a power of two. Using Exercise 88 with <j> = tt/9 we get that, 

cos — = 4 cos — — 3 cos — <^ 1 — 8 cos — — 6 cos — . 
3 9 9 9 9 

Thus, if u = 2 cos(7r/9), then u 3 — 3u — 1 = 0. This polynomial is irreducible over Q by the reduction 
test applied with p = 2, so it is the minimum polynomial over Q of 2cos(7r/9). Thus, the extension 
Q Q Q(2 cos(7r/9)) = Q(cos(7r/9)) has degree three, and the angle ir/9 cannot be constructed. 
We will be able to say more about which angles of the form w/n can be constructed in §15.. 

Exercise 89 

1 . Can you construct an angle of 40° ? 

2. Assuming 72° is constructible, what about 24° and 8°? 

Further Exercises for §10. 
Exercise 90 The octahedron and icosahedron are two of the five Platonic solids. 




x 3 V2 5x 3 (3 + %/5) 

Vn = Vi = 

3 12 

The volume of each is given by the formula, where x is the length of any edge. Show that in each case, there is no general 
method, using a ruler and compass, to construct a new solid from a given one, and having twice the volume. 



Exercise 91 Consider a regular dodecahedron with volume as given. 




x 3 (15 + 7a/5) 



Show that there is no general method, using a ruler and compass, to construct a new dodecahedron from a given one, and having 
five times the volume. 

Exercise 92 Let So, Sd and Si be the surface areas of the three Platonic solids of Exercise 90. If, 

So = 2x 2 Vs,S D = 3x 2 ^ 5(5 + 2^) and Si = 5x 2 Vs, 
determine whether or not a solid can be constructed from a given one with twice the surface area. 
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Exercise 93 Using the identity 

cos 50 = 16 cos 5 - 20 cos 3 + 5 cos 0. 
Show that is is impossible, using a ruler and compass, to quinsect (that is, divide into 5 equal parts) any angle ip that satisfies, 

, 5 
cos ib = — 
6 

Exercise 94 Using the identity, 

cos 70 = 64 cos 7 - 112 cos 5 + 56 cos 3 8 - 7cos0 
show that it is impossible, using ruler and compass, to septsect (that is, divide into seven equal parts) any angle ip such that 

7 

cos ip = — 
8 

Exercise 95 Show that if a general angle can be n-sected (that is, divided into n equal parts) then a regular n-gon can be con- 
structed. Use this to re-deduce the result of the last exercise and to obtain conditions on a prime p, such that a general angle can be 
divided into p equal parts. 
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§11. Groups I: A Miscellany 

As the title suggests, this section collects together some (carefully chosen) random facts about groups. 

(11.1) A permutation of a set X is a bijection X — ► X. Mostly we are interested in the case where 
X is finite, say X = {1, 2, . . . , n}, so that a permutation is just a rearrangement of these numbers. 
Permutations are most compactly written using the cycle notation 

(an, ai2, ■ • ■ , a!i ni )(a 2 i , a 22, ■ ■■ , a2n 2 ) ■ ■ ■ ((*«., «fc2, ■ ■ ■ , "fcnj 

where the ay are elements of {1,2,..., n}. Each (qu, ai 2) • ■ ■ , amj indicates that the points are per- 
muted in a cycle as 

Q!j2 "ini J 



The cumulative effect of the cycles is obtained by dealing with them from right to left, eg: (1, 2)(1, 2, 4, 3) 
(1, 3)(2, 4) = (1, 2, 3). A permutation can always be rewritten so that the points in the cycles are all 
distinct. 

The set of all permutations of the set X forms a group under composition of bijections called the 
symmetric group Sx, or S n if X = {1, 2, . . . , n}. 

(11.2) Any permutation can be written as a composition of permutations where just two things are 
swapped, and everything else is left fixed. In other words, any permutation can be written as a com- 
position of transpositions of the form (a, b): 

(ai,a 2 , . . . ,ai) = (ai,Oi)(ai, ai_i) . . . (ai,a 3 )(ai,a 2 ). 

Indeed, much more is true: there may be several ways that a permutation can be decomposed into trans- 
positions like this, and different ways may not involve the same number of transpositions, but any two 
such decompositions will either both involve an even number of transpositions or both an odd number. 

We can thus, without any ambiguity, call a permutation even if it can be decomposed into an even 
number of transpositions, and odd otherwise. The Alternating group A n consists of all those elements of 
S„ that are even. 

Exercise 96 Show that A n is indeed a group comprising exactly half of the elements of S n . Show that the odd elements in S n do 
not form a group. 

Exercise 97 Recall that the order of an element g of a group G is the least n such that g n = 1. Show that if G is Abelian then 
{gh) n = g n h n for every g,h £ G. Deduce that the order of gh is then the lowest common multiple of the orders of g and h. 

(11.3) If G is a group and {gi, g 2 , ■ ■ ■ , g n } are elements of G, then we say that the generate G when 
every element jeG can be obtained as a product 

of the gi and their inverses. Write G = (gi, g^, . . . , g n ). 

(11.4) We find generators for the symmetric and alternating groups. Firstly, we have already seen that 
the transpositions (a, b) generate S n , for any permutation can be written as a product 

(ai,a 2 , ...,£**)= (ai,a i )(a i ,a! i _i) . . . (ai, a 3 )(ai, a 2 ). 



62 



Symbol 


Name 


Z p 


cyclic 


An 


alternating 


notes: p is a prime; 




1,2,4 





Table 1 : The simplest two families of simple groups 



of transpositions. The transpositions themselves can be expressed in terms of just some of them: letting 
be our transposition now with i < j, we have 

= (i,i + l)(i + l,i + 2)...(j- l,j - 2)(j,j - 1) . . . (t + l,i + 2)(i,i + 1) 

where the easiest way to see that this works is to consider the picture, 

and perform the swaps indicated in the picture in the following order: do the swaps across the top first, 
from left to right, and then the swaps along the bottom from right to left. Any number strictly in between i 
and j moves one place to the right and then one place to the left, with net effect that it remains stationary. 
The point i is moved progressively along to j by the top swaps, but then stays there. Similarly j stays put 
for a while but is then moved progressively rightwards by the bottom swaps. 

Substituting this new expression for each transposition gives any permutation in S n as a product of 
transpositions of the form (i,i + 1). But in fact even these transpositions can be further reduced, by 
transferring i and i + 1 to the two points 1 and 2, performing the swap between these two and transferring 
the answer back to i and i + 1. Indeed if r = (1,2, ... ,n) then the picture, 



• i+l 



gives (i, i + l) = t l (1, 2)r 1 2 as t 1 1 sends 1 to i, 2 to i + 1 and so on, while r 1 1 is its inverse. 
The conclusion is that S n is generated by just two permutations, namely (1, 2) and (1,2, ... , n). 

Exercise 98 Show that the Alternating group is generated by the permutations of the form (a, b, c). Show that in fact just the 
3-cycles of the form (1, 2, i) will suffice. 



(11.5) Lagrange's theorem tells us that if G is a finite group and H a subgroup of G, then the order \H\ 
of H divides the order \G\ of G. The converse, that if a subset of a group has size dividing the order of 
the group then it is a subgroup is false. 

Exercise 99 By considering the Alternating group A4, justify this statement. 

Exercise 100 Show that if G is a cyclic group, then the converse to Lagrange's theorem is true, ie: if G has order n and k divides 
n then G has a subgroup of order k. 

Exercise 101 Use Lagrange's Theorem to show that if a group G has order a prime number p, then G is isomorphic to a cyclic 
group. Thus, any two groups of order p are isomorphic. 

There is however a partial converse to Lagrange's Theorem, due to the Norwegian Peter Sylow 8 

Sylow's First Theorem. Suppose G is a finite group of order p k m, where p does not divide m (ie: k is 
the largest power of p dividing the order of G). Then G has a subgroup of order p l for any 1 < i < k. 

8 pronounced Soo-lov, not Si-low. 
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(11.6) It is often useful to consider all the subgroups of a group at once, rather than just one at a time. 
The information is summarised in the subgroup lattice, which is a diagram depicting all the subgroups 
and the relations between them. Specifically, if Hi, Hi are subgroups of G with Hi C H2, place H2 
higher in the diagram than Hi with a line connecting them like so, 

H 2 



Hi 

At the very base of the diagram is the trivial subgroup {id} and at the apex is the other trivial subgroup, 
namely G itself. For example, the subgroups of the integers Z all have the form nL for some n (ie: the 
multiples of n) and arrange themselves into the lattice: 



I, 




2Z 3Z 5Z 7Z 11Z 




4Z 6Z 9Z 10Z ■■• 

I : I : 

8Z 27Z 

As another example, the group of symmetries of a square consists of the eight elements, 

{id, a, a 2 , a 3 , r, ot, <j 2 t, <t 3 t}, 

where er is a rotation anticlockwise through | of a turn and r is a reflection in the horizontal axis. The 
subgroup lattice looks like, 



G 




{id, a 2 r} {l,r} {id, a 2 } {id,a 3 r} {id,ar} 




{id} 



(11.7) Suppose we have a finite group G and a sequence of subgroups H = {1}, Hi, . . . , H n _i, H n = 
G arranged as follows: 

{1} = H a < Hi < ■ ■ ■ < < H n = G, 

ie: Hq is a normal subgroup of Hi, which is in turn a normal subgroup of if 2, and so on. In fact, we 
can always ensure this if the group is finite: find a normal subgroup of G, then a normal subgroup of that 
normal subgroup, and so on. Eventually the process must stop with the identity subgroup. 

Whenever we have normal subgroups we get new groups by taking the quotient. Given a sequence like 
the above then, we get a sequence of quotient groups, 

Hi/ Hq, if 2/ Hi, ... , H n j H n -i. 

In principle these quotient groups could be anything. In the quite special situation that they all turn out to 
be Abelian, call the group G soluble. 
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Symbol 



Name 



Discovered 



PSL n F q projective 1870 

PSP 2 „F 9 simplectic 1870 

PO+„ orthogonal 1870 

P0 2 „+i orthogonal 1870 

E 6 {q) Chevalley 1955 

E 7 {q) Chevalley 1955 

E s (q) Chevalley 1955 

F 4 (q) Chevalley 1955 

G 2 {q) Chevalley 1955 

2 A n (q 2 ) = PSU„F g 2 unitary or twisted Chevalley 1 870 

2 D n (q 2 ) = P^n orthogonal or twisted Chevalley 1870 

2 E 6 (q 2 ) ' twisted Chevalley c. 1960 

3 L> 4 (<7 3 ) twisted Chevalley c. 1960 

2 B 2 (2 2e+1 ) Suzuki 1960 

2 G 2 (2 2e+1 ) Ree 1961 

2 F 4 (2 2e+1 ) Ree 1961 



notes: n and e are e Z There are some restrictions on n 
q is a prime power; and q, left off here for clarity. 

Table 2: The simple groups of Lie type 



(11.8) If G is an Abelian group, then consider the sequence of subgroups, 

{1}<G, 

(note that the trivial subgroup is always a normal subgroup). There is only one quotient group to consider 
here, namely G/{1} = G, an Abelian group. Thus Abelian groups themselves are soluble, and indeed, 
one can think of solubility as a generalisation of Abelian. 

(11.9) For another example, take the symmetries, both rotations and reflections, of a regular n-gon in the 
plane, and the sequence, 

{1} < {rotations} < {rotations and reflections} 

To convince ourselves first of all that this is indeed a proper sequence, we need that the rotations form a 
normal subgroup of the full group of symmetries. That they form a subgroup is not hard to see, and the 
normality follows from the fact that the rotations comprise half of all the symmetries and Exercise 110. 

Moreover, the rotations are isomorphic as a group to the cyclic group Z n , and so the quotients of this 
sequence are 

{rotations}/{l} = {rotations} = Z„ and {rotations and reflections}/{rotations} = Z 2 , 
both Abelian groups. Thus the dihedral groups are soluble 9 . 

(11.10) It turns out, although for quite technical reasons (see the next couple of exercises) that a subgroup 
of a soluble group is also soluble. 

Exercise 102 Let H be a subgroup and N a normal subgroup of some group G and, 

NH = {nh | n £ N, h e H}. 

'Groups like this, where you have a 2-step sequence {1} < H < G, with Abelian quotients are sometimes called meta-Abelian. 
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1. Define a map ip : H — > NH/N by Lp(h) = Nh. Show that ip is an onto homomorphism with kernel N D H. 

2. Use the first isomorphism theorem for groups to deduce that H/H D N is isomorphic to NH/ H. 

(This is called the second isomorphism or diamond isomorphism theorem. Why diamond? Draw a picture of all the subgroups-the 
theorem says that two "sides" of a diamond are isomorphic). 

Exercise 103 Let G be a soluble group with series, 

{1} = H < Hi < • • ■ < H„_i < H n = G, 
and K a subgroup of G. Intersect K with all the Hi and use the second isomorphism theorem to show that 

{1} = H n K < Hi n K < • • • < H n _i nK <H n nK = K, 
is a series with Abelian quotients for A", hence if is soluble too. 

(11.11) In some sense the antithesis of the soluble groups are the simple groups: groups G whose only 
normal subgroups are the trivial subgroup {1} and the whole group G. These two are always normal 
subgroups, so one could say that a group is simple when it has no non-trivial normal subgroups. 

Whenever we have normal subgroups we can take quotients, so another way of putting it to say that a 
group is simple whenever its only quotients are itself G/{1} = G and the trivial group G/G = {1}. In 
this way simple groups are analogous to prime numbers, which are integers whose only quotients 10 are 
themselves p/1 = p and the trivial integer p/p = 1. 

The reason that simple groups are at the opposite end of a spectrum to soluble ones is this: if G is 
non- Abelian and simple, then G cannot be soluble. For, the only sequence of normal subgroups that G 
can have is 

{1}<G, 

and as G is non- Abelian the quotients of this sequence are non- Abelian. Thus, non- Abelian simple groups 
provide a ready source of non-soluble groups. 

(11.12) So what are these groups then? Amazingly, there is a complete list, compiled over approximately 
150 years, through the efforts of over a 100 mathematicians, and running to roughly 15000 pages of 
research articles. It is quite possibly the greatest taxonomic (if not necessarily conceptual) achievement 
of 20th Century Mathematics. The list is contained in Tables 1-3. 

Exercise 104 Show that if p is a prime number then the cyclic group Z p has no non-trivial subgroups whatsoever, and so is 
certainly a simple group. 

(11.13) Looking at Table 1 we see that the Alternating groups A n are simple for n ^ 1, 2 or 4. Thus 
these Alternating groups are not soluble, and as any subgroup of a soluble group is soluble, any group 
containing the Alternating group will also not be soluble. Thus, the symmetric groups S n are not soluble 
ifn ^ 1,2 or 4. 

Exercise 105 Show that the previous statement is not quite correct in that the symmetric group S3 is soluble. 

(11.14) Tables 2 and 3 list the really interesting simple groups. The groups of Lie type are basically 
groups of matrices whose entries come from finite fields. We have already seen that if q = p n a prime 
power, then there is a field W q with q = p n elements. The group SL„F«j consists of the n x n matrices 
with entries from this field and the usual matrix multiplication. Unfortunately this group is not simple as 
the subset 

N = {XI n I A e FJ, 

consisting of all scalar multiples of the identity matrix forms a normal subgroup. But it turns out that this 
is the biggest normal subgroup you can find in the sense that the quotient group, 

SL n ¥ q /N 7 

'"Obviously the way it is normally put is to say that the only divisors are itself and one, but as the notion of divisor does not carry 
over quite so easily to group theory, we use quotients instead. 
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Symbol Name Discovered Order 



1. First generation of the Happy Family 

Mn Mathieu 1861 2 4 3 2 5 11 

M12 Mathieu 1861 2 4 3 3 5 11 

M 22 Mathieu 1873 2 7 3 2 5 711 

M 23 Mathieu 1873 2 7 3 2 5 711 23 

M 24 Mathieu 1873 2 10 3 3 5 711 23 

2. Second generation of the Happy Family 

HJ Hall-Janko 1968 2 7 3 3 5 2 7 

HiS Higman-Sims 1968 2 9 3 2 5 3 711 

McL McLaughlin 1969 2 7 3 6 5 3 711 

Suz Suzuki 1969 2 13 3 7 5 2 711 13 

Coi Conway 1969 2 21 3 9 5 4 7 2 11 13 23 

Co 2 Conway 1969? 2 18 3 6 5 3 711 23 

Co 3 Conway 1969? 2 10 3 7 5 3 711 23 

3. Third generation of the Happy Family 

He Held 1968 2 10 3 2 5 2 7 3 17 

Fi 22 Fischer 1968 2 17 3 9 5 2 711 13 

Fi 23 Fischer 1968 2 18 3 13 5 2 7 11 13 17 23 

Fi 2i Fischer 1968 2 21 3 16 5 2 7 3 11 13 17 23 29 

F 5 Harada-Norton 1973 2 14 3 6 5 6 711 19 

F 3 Thompson 1973 2 15 3 10 5 3 7 2 13 19 31 

F 2 Fischer or "Baby Monster" 1973 2 41 3 13 5 6 7 2 11 13 17 19 23 47 

M Fischer-Griess or "Friendly Giant" or "Monster" 1973 w 10 55 

4. 77?e Pariahs. 

Ji Janko 1965 2 3 5 7 11 19 

J 3 Janko 1968 2 7 3 5 5 1719 

J 4 Janko 1975 2 21 3 3 5 7 ll 3 23 29 31 3743 

Ly Lyons 1969 2 s 3 7 5 6 7 11 31 37 67 

Ru Rudvalis 1972 2 14 3 3 5 3 713 29 

O'N O'Nan 1973 2 9 3 4 5 7 3 11 19 31 



Table 3: The sporadic simple groups 
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has no non-trivial normal subgroups, ie: is a simple group. It is denoted PSL n F g , and called the n- 
dimensional projective 11 special linear group over ¥ q . The remaining groups in Table 2 come from more 
complicated constructions. 

Table 3 lists groups that don't seem to fall into any of the categories described so far-for this reason they 
are called the "sporadic" simple groups. They arise from various (often quite complicated) constructions 
that are well beyond the remit of these notes. The most interesting of them is the largest one-the Monster 
simple group (which actually contains quite a few of the others as subgroups). The Monster has a number 
of fascinating connections with a diverse range of mathematical areas, including number theory (where it 
plays a central role in something called "Monstrous Moonshine") and even Mathematical Physics. 

All of this notwithstanding, the simple groups in Tables 2 and 3 are all non-Abelian, hence provide 
ready examples of non-soluble groups. 

Further Exercises for §11. 
Exercise 106 Show that any subgroup of an abelian group is normal. 

Exercise 107 Let G be a group. Show that G/G {id} and G/{id} G. 

Exercise 108 Let n be a positive integer that is not prime (sometimes called a composite integer). Show that the cyclic group Z„ 
is not simple. 

Exercise 109 Show that A-2 and A4 are not simple groups, but A3 is. 

Exercise 110 Let G be a group and H a subgroup such that H has exactly two cosets in G. Let C2 be the group of order two with 
elements {—1, 1} and operation just usual multiplication. Define a map / : G > C2 by 

r/ i f 1 9 6 H 

1 . Show that / is a homomorphism. 

2. Deduce that H is a normal subgroup. 

Exercise 111 Consider the group of symmetries (rotations and reflections) of a regular n-sided polygon for n > 3. Show that this 
is not a simple group. 

Exercise 112 Show that S2 is simple but S„ isn't for n > 3. Show that A n has no subgroups of index 2 for n > 5. 

Exercise 113 Show that if G is abelian and simple then it is cyclic. Deduce that if G is simple and not 7L V then G is non-Abelian. 

Exercise 114 For each of the following groups G, draw the subgroup lattice J£q\ 

1 . G = the group of symmetries of a square, pentagon or hexagon. 

2. G = the cyclic group {1, a, a 2 ,..., cr™- 1 } where crV^ = cr 4 +J mod " and o n = 1. 



n fhe name "projective" comes fom the fact that the group is the group of symmetries of projective geometry over the field F q 
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§12. Groups II: Symmetries of Fields 

We are finally in a position to introduce the idea of symmetry into the solutions of polynomial equations. 

(12.1) An automorphism of a field F is an isomorphism a : F — > F, ie: a bijective map from F to F 
such that cr(a + b) = a (a) + <r(b) and a(ab) — a(a)a(b) for all a, b g F. 

We commented in §4. that an isomorphism of fields (indeed of any algebraic object) was just a rela- 
belling of the elements using different symbols. The "algebra" is identical though. An automorphism is 
then a relabelling that is achieved merely by moving the elements of F around amongst themselves. So 
it is a way of picking the field up and placing it down upon itself so it looks like the same field: it is thus 
a symmetry of the field. 

Exercise 115 Show that if a is an automorphism of the field F then <r(0) = and cr(l) = 1. 

(12.2) A familiar example of a field symmetry/automorphism is complex conjugation: the map z z is 
an automorphism of C, for, from elementary complex analysis we have, 



z + w = z + w and zw = zw, 

with conjugation a bijection C — > C. This symmetry captures the idea that from an algebraic point of 
view, we could have just as easily adjoined — i to M, rather than i, to obtain the complex numbers (they 
look the same upside down as right side up!). 

We will see at the end of this section that if a non-trivial automorphism of C fixes pointwise the real 
numbers, then it must be complex conjugation. If we drop the requirement that M be fixed then there may 
be more possibilities: if we only insist that Q is fixed pointwise, there are infinitely many. 

(12.3) Every field F has a prime subfield that is either ¥ p or Q. Every element of the prime subfield has 
the form, 



„ ' 

n times 

If a is now an automorphism of F we have 



J 1 + 1 + - + 1 )=a( l + l + ... + i )a( I 



= (*(!) +(7(1) + ■■■+*(!)) 



1 \ 1 + 1 



(7(1) + (7(1) + • • • + (7(1) ) 1 + 1 



n n times 

as <r(l) = 1. Thus the elements of the prime subfield are fixed pointwise by the automorphism. 

Exercise 116 We saw above that the map a + bi >-* a — hi is an automorphism of C. Show that a + bi ^ —a + hi is not an 
automorphism of C. 

(12.4) Symmetries of things normally arrange themselves into a group, and field symmetries are no 
exception. We could talk just of the symmetry group of a field, but it turns out to be more instructive to 
make a slightly more elaborate definition that takes into consideration not just fields, but their extensions: 

Definition. Let F C E be an extension of fields. The automorphisms of the field E that fix pointwise 
the elements of F form a group under composition called the Galois group of E over F and denoted 
Gsl{E/F). 
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Thus an element a of Gal (E/F) has the property that cr(A) = A for all A e F. 

Exercise 117 For F C E fields, show that the set of automorphisms Gal (E/F) of E that fix F pointwise do indeed form a group 
under composition. 



(12.5) Consider as an example the field <Q(v2, From the proof of the tower law, a basis for this field 
over Q is given by {1, \/2, i, \/2i}, so that the elements of the field are, by Theorem D, 

Q(V2, i) = {a + bV% + ci + dV2i \a,b,c,de Q}. 

Suppose we have a symmetry a e Gal (Q(\/2, i) /Q) and consider its effect on a typical element, 

cr(a+bV2 + ci + dV2i) = cr(a) + a(b)a(V2)+a(c)a(i)+a(d)a(V2i) = a+ba(V2)+ca(i)+da(V2i) 

using the properties of automorphisms and the fact that a fixes rational numbers. Thus, the symmetry a 
is completely determined by its effect on the basis elements {1, \f2, i, \/2i}, in that once their images are 
decided, then a is uniquely known. 

Aside. We can see that this is really no surprise. When we have fields F C E, then among other things, E is a vector space over 
F. Given a symmetry <r £ Gal (E/F), then this is, among other things, a linear map of vector spaces E — > E, and we know from 
linear algebra that such things are completely determined by their effect on a basis. 

Actually we can say even more. Clearly <t(1) = 1 is always true, and cr(\/2i) = a(y/2)a(i). Thus the 
symmetry a is completely determined by its effect on \J~2 and i, the elements adjoined to Q. 

(12.6) And indeed this is a general fact. If F C F(ai, a.2, • • • , afe) = E and a e Gal (E/F), then a is 
completely determined by its effect on the ai, . . . , For, suppose that {/3i, . . . , /?„} is a basis for E 
over F, so that a is completely determined as above by its effect on the From the proof of the tower 
law, each is a product of the form, 

j 8 i = ai 1 4 a ...4* ) 

and so cr(pi) = a(a\) 11 o^a^)* 2 . . . a(ak) lk - Thus a((3) in turn is determined by the cr(ai). 

(12.7) The structure of Galois groups can sometimes be determined via ad-hoc arguments, at least in very 
simple cases. 

Let ui be the primitive cube root of 1, 

1 V3 

and consider the extension Q C Q(w). Although the most obvious polynomial that 
iv is a root of is x 3 — 1, this is reducible, so the minimum polynomial of u over Q is in fact x 2 + x + 1 
(see Exercise 29 where we showed 1 + x + x 2 + ■ ■ ■ + x p_1 to be irreducible over Q for p a prime). 
Thus by Theorem D, Q(u) = {a + bu; \ a, b e Q}, giving that Q(ui) is 2-dimensional over Q with basis 
{1, uj}. Suppose that a G Gal (Q(w)/Q), whose effect we now know is completely determined by where 
it sends u>. Suppose cr(a>) = a + bio for some a,b e Q to be determined. On the one hand we have 
<t(oj 3 ) = <r(l) = 1, while on the other, 

<j(lu 3 ) = a(uj) 3 = (a + buo) 3 = [a 3 + b 3 - 3ab 2 ) + (3a 2 b - iab 2 )uj. 

The last bit uses the fact that uj 2 = —ui — 1. 

One of the consequences of {1, uj} being a basis for Q(ui) over Q is that elements have unique expres- 
sions as linear combinations of these two basis elements (this is a consequence of linear independence). 
This means that given two expressions for an element as linear combinations of 1 and u, we can "equate 
the 1 and ui parts 12 ". Thus, 

1 = a(uj 3 ) = (a 3 + b 3 - 3ab 2 ) + [3a 2 b - 3ab 2 )u, so that a 3 + b 3 - 3ab 2 = 1 and 3a 2 b - 3ab 2 = 0. 




12 Just as we equate real and imaginary parts of complex numbers, and for the same reason: {1, i} is a basis for C over R. On the 
other hand, we could not do this for two expressions in terms of 1, u> and u 2 
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Solving these last two equations (in Q!) gives three solutions a = 0, b = 1, a = 1,6 = and a = 
— 1, b = —1, coresponding to ct(cj) = ui and a(ui) = — 1 — ui = u 2 (the middle solution gives (j{uo) = 1 
which is impossible as a is a bijection and we already have that a{\) = 1). Thus the Galois group 
Gal (Q(w) /Q) = {id, cr} has order two. 

(12.8) Our first major tool for unpicking the structure of Galois groups is, 

Theorem F (The Extension Theorem). Let Fx , F 2 be fields and r : Fi —> F 2 an isomorphism between 
them with r* : i*\ [x] — > F 2 [x] the resulting homomorphism of rings given by t* (X a i xl ) = X T(a i )x q '. 
If a is algebraic over F±, then the isomorphism t extends to an isomorphism a : F\{a) — > F 2 {(3) with 
a(a) = (3 if and only if (3 is a root of t* (/), where f is the minimum polynomial of a over F\. 

The elements a and [3 are assumed to lie in some extensions Fi C Ei of the two fields, and when we 
say that r extends to a we mean that a\ F = r. 

F(a)- - — F(a) The theorem is quite technical, but nevertheless has an intuitive meaning. Suppose 
we have the special case where f\ = F 2 = F and a is the identity isomorphism (hence 
a* is also the identity map). Then we have an extension a : F(a) — ► F(fi) precisely 
• j when (i is a root of the minimum polynomial f of a over F. Indeed we can say even 

F ' F more: if (3 is an element of F(a), then F(f3) C F(a), is an F-vector subspace, but 

since / must also be the minimum polynomial of (3, F{f3) is (deg /) -dimensional over F, just like F(a), 
and so F(/3) = F(a). Thus d is an automorphism of F(a) fixing F pointwise, and so an element of the 
Galois group Gal (F(a)/F). Summarising everything we know about Galois groups so far, 

Corollary. Let a be algebraic over F with minimum polynomial f over F. A map a : F(a) — > F(a) is 
an element of the Galois group Gal (F(a) /F) if and only if for any X)fe=o^ _1 a k ak <= F(a) we have, 

/deg/-l s dcg/-l 

erf a k ak ) = akf3k > 

^ k=0 ' k=0 

where fi is also a root of f contained in F(a). 

Thus the elements of the Galois group permute the roots of the minimum polynomial that are contained 
in F(a) amongst themselves. 

Proof of the Extension Theorem: We give a "grungy" proof that nevertheless makes the situation nice 
and concrete. For the only if part, we have that if / = ^ aix 1 with f(a) = 0, then a i a% — m F\ so 
that 

(T^Y^aiOt^j =0=>Y (J ( a *) ,J ( a y =0^^r(a,)/3 4 = 0in£2, 

giving that [3 is a root of t* (/) as claimed. 

For the if part, we need to define an isomorphism with the desired properties. The elements of F(a) 
all have the form J2^Iq cua 1 , where d — deg /. Define a by 

5>^M =^T(a 2 )/f, (4) 

i=0 ' i=0 

for any m. From this definition we see that cr(a) = r(a) for any a e F± and also that a(a) — (3. 
The proof then proceeds in three parts, a is well-defined and 1-1: Suppose we have two expressions 

representing the same element of F\ (a). Thus XX a i — h)a l = giving that a is a root of the polynomial 
g = J2( a i ~ bi)x % G Fi[x\. As / is the minimum polynomial of a over Fx we must have that / is a 
factor of g, and so, 

g = fh*T*(g)=T*(fh)=T*(f)T*(h). 
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Hence r* (/) is a factor of t* (g). As is a root of r* (/) it must then also be a root of t* (5), ie; 



r* (.g)(/3) - ^ ^ T(a t - hi)? =0^J2 = E T ^ a (E = CT (E ■ 

Thus the two expressions for the same element are sent to the same element of F 2 (0), giving that a is 
well-defined. This is the "only if" part of all the equivalences above, with the "if" part giving that a is 
1-1. 

is a homomorphism: we need to show that a respects the addition and multiplication in the field 

Fi(a). Let 

n m 

A = 53 aid 1 and n = 53 ha 1 , 

be two elements. Then 



i=0 i=0 



(max{m,n} > max{m,n} 

J2 K + ) = E T ( a * + 6i )^ 
i=0 ' i=0 

n m 

= ^ r(a 4 )/3 4 + ^ r(& 4 )/3 4 = c(A) + <r(/i). 

i=0 i=0 

Similarly, 

53 53 a^- V = 53 r 53 a^- W - 53 E ^Mb,) V 

/c__0 \+j=fc ' ' k=0 M+j=fc ' fc=0 M+j=fc ' 

n m 

= 53r(a l )/3 l 53r(6 J )/3 J =^(A)a( A1 ). 

i=0 j=0 

One comment: in both cases we had a of an expression, and we replaced this by the definition given at 
(4). Certainly in the case of multiplication, the expression was quite possibly not of the form a polynomial 
in a of degree < d. If we had defined a for just these expressions we wouldn't have been able to use (4) 
as it stands. Thus we defined a for any expression, but this then requires we show the definition to be 
well-defined, for an element has many different expressions as polynomials in a, if we relax the condition 
that these expressions have degree < d. 

a is onto: Certainly we have a(Fi(a)) is contained in F 2 {0) by the definition at (4)-the right hand 
side is contained in F 2 {0). On the otherhand, any /1 E F 2 arises as the image t(A) for some A e F\, as 
r is onto. Also = a(a) by definition, so F 2 , <E a(Fi(a)), hence F 2 (0) C a(Fi(a)), giving that the 
image cr(Fi(a)) is F 2 (0). □ 

(12.9) If we compute instead the Galois group of the extension Q C Q(v^2), we have the freedom to 
send \J~2 to those roots of its minimum polynomial over Q that are also contained in the field Q(\^2). 
This minimum polynomial is a; 3 — 2 which has the roots a, auj and auo 2 for a = \/2 and 

1 ^2 
LJ = -2 + — 1 - 

But now the roots auo and auo 2 are not contained in Q(\/2) as this field contains only real numbers while 
these roots are clearly non-real. Thus the only possible image for a is a itself, giving that Gal (Q(a)/Q) 
is just the trivial group. 



(12.10) Returning to the example we calculated in an ad-hoc fashion immediately before the extension 
theorem, any automorphism of Q(uo) that fixes Q pointwise is determined by where it sends u>, and this 
must be to a root of the minimum polynomial over Q of uo. As this polynomial is 1 + x + x 2 with roots 
uo and uj 2 , we get that the possible automorphisms send uo to itself or to uj 2 , ie: 

Gal (Q(w)/Q) = {id,a}, 
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where a(a + bio) = a + bui 2 is in fact just complex conjugation. This answers Exercise 3, in showing 
that the left hand figure depicts an automorphism, but the right hand figure does not: 




[> 







(12.11) The only if part of the proof of the Extension Theorem is usefully stated as a separate, 

Corollary. If g is a polynomial with F '-coefficients and with a root a € E, then for any a G Gal (E/F), 
the image a (a) is also a root of g. 

An immediate and important consequence is, 

Corollary. IfFC. E is a finite extension then the Galois group Gal (E/F) is finite. 

Proof: If {ai, Q!2, . . . , is a basis for E over F, then we have E — F(a\,a2, ■ ■ ■ , «fc), and by 
Proposition 3, each of the a, is algebraic over F with some minimum polynomial fa e F[x]. If a is an 
element of the Galois group, then a is completely determined by where it sends each Qj, for which there 
are only finitely many possiblities: to the roots of /j. □ 



(12.12) Let p be a prime and let 



be a root of 1. 



UJ" 



UJ 



k+1 




2tt . 2tt 

uj = cos h i sin — , 

P P 



By the Extension Theorem we have an element of the Galois group 
Gal (Q(w)/Q) precisely when it sends ui to some root contained in 
Q(w) of its minimum polynomial over Q. The minimum polynomial 
is the p-th cyclotomic polynomial, 



.p-i 



$ p = 1 + x + x 2 H Vx- 



as we saw in Exercise 29, with roots the other roots of 1 (except for 1 



itself) namely u>, ui , . . . 



1 . Clearly all these roots are contained 



in Q(u>), and so we are free to send ui to any one of them. Thus, the 
Galois group has order p — 1, with an element corresponding to each 



where ui p = 1. 



of the possible images of uj. If o(uj) = uj k then <t 1 (uj) 

We saw in §9. that the multiplicative group of the finite field F p is cyclic. In otherwords, there is a k 
with 1 < k < p, such that the powers k l of k exhaust all of the non-zero elements of F p , ie: the powers 
k l run through {1, 2, ... ,p — 1} modulo p, or k generates F*. 

Putting the previous two paragraphs together, if we take a a in the Galois group with u{uj) = u k for k 
a generator of F* , then the elements, 

a(uj),a 2 (uj), . . . ,cr p_1 (w), 

run through the roots {uj, ui 2 , . . . , Thus the elements a, a 2 , . . . , <t p_1 exhaust the Galois group, 

and so the Galois group of the extension Q C Q(u>) is cyclic. This (I imagine) is the reason behind the 
term cyclotomic. 
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(12.13) The extension theorem gives the existence of extended automorphisms, but also indicates the 
number of such extensions: there is one for each root of cr*(f) contained in E 2 . Making this more 
precise: 

Theorem G. Let r : F\ — > F 2 be an isomorphism and F\ C E\ and F 2 C E 2 be extensions with E\ a 
splitting field of some polynomial f over F\ and E 2 a splitting field of t* (/) over F 2 . Assume that the 
roots ofr*(f) in E 2 are distinct. Then the number of extensions ofr to an isomorphism a : E\ — > E 2 is 
equal to the degree of the extension F 2 C E 2 . 

Proof: The proof proceeds by induction and the Extension Theorem. 

F 2 w e have that E\ = F\(ai, . . . , ayeg/) as it is a splitting field for / over F\. Let 
a = a\ and consider the extension F\ C F\ (a) and the picture at left where [3 is some 
F\(a) —'-^F 2 (j3) e l ement °f F 2 . We have by the Extension Theorem that r extends to an isomorphism 
t' : Fi (a) — > F 2 ([3) if and only if (3 is a root in E 2 of r* (p), where p is the minimum 
polynomial of a over F\ . In particular, F 2 (/3) is isomorphic as a vector space over F 2 
to the vector space Fi(a) over F\ (= F 2 ), and so they must have the same dimension. 

F ± I ,p 2 Thus any polynomial that has f3 as a root, has, by Theorem D, degree at least that of p, 

so that T*(p) does. On the otherhand we always have degr*(p) < degp. Thus r*(p) 
must be the minimum polynomial of j3 over F 2 . 

As a is a root of / we have that p divides /, ie: / = ph in F\ [x] so that r* (/) = t* (p)r* (h) in F 2 [x] 
giving that t* (p) divides r* (/). As the roots of r* (/) are distinct, those of r* (p) must be too. 

Thus the number of possible extensions r', which is equal to the number of distinct roots of r* (p), must 
in fact be equal to the degree of r* (p), which is in turn the degree of the extension [F 2 ((3) : F 2 ] > 1. By 
the tower law, 

[E 2 :F 2 ] = [E 2 : F 2 (J3)][F 2 (p) : F 2 ], 
and by induction, any isomorphism r' : F\ (a) — > F 2 {[3) will have 

[E 2 :F 2 {(3)]= [E2:F2] 



[F 2 (f3):F 2 y 

extensions to an isomorphism a : E\ — > E 2 . Finally then, starting from the very bottom, r extends to 
[F 2 ((3) : F 2 ] possible r"s, and extending each in turn gives, 

extensions in total. □ 

The reason that the roots of t* (/) need to be distinct is that we can then relate the number of auto- 
morphisms to degrees of extensions by passing through the midway house of the roots of polyomials. If 
the polynomial has repeated roots then the number of automorphisms would be less that the degree of the 
extension and so the set-up is less conveniently described. 

Thus the requirement in the Theorem, and later in the notes, that the roots be distinct is not essential to 
the theory per se, but allows the theorems to be stated in a nice way. 

(12.14) To summarise where we are at, Theorem D gives a connection between the degrees of field 
extensions and minimum polynomials, while the Extension Theorem and Theorem connect minimum 
polynomials with the number of automorphisms of a field. Perhaps the following theorem is not then so 
surprising: 

Corollary. Let f be a polynomial over F and E = F(ai, . . . , a m ) its splitting field over F with the 
roots an off distinct. Then 

\Gal {E/F)\ = [E:F\. 
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The polynomial / is contained in the ring F[x] over the field F, with E a vector space over F and 
Gal (E/F) the group of automorphisms. In this concise statement appears all the major algebraic objects 
of undergraduate mathematics. 

Proof: The result follows immediately from Theorem by letting F\ = F 2 = F, E\ = E 2 = E and r 
be the identity automorphism. This gives that there are [E : F] extensions of the identity automorphism 
F — > F to automorphisms of E. On the otherhand, any automorphism of E fixing F pointwise is an 
extension of the identity automorphism on F, and so we obtain all the elements of the Galois group this 
way. □ 

(12.15) The criterion that E be a splitting field is important in using Theorem and it's Corollary properly. 
If you consider the extension Q C Q(\/2), then a is an element of the Galois group Gal (Q(v / 2)/Q) 
precisely when it sends \/2 to a root contained in <Q(\/2) of its minimum polynomial over Q. As these 
roots are \/2 itself and the other two are complex and the field Q(\/2) is completely contained in R, the 
only automorphism we can have is the one that sends \/2 to itself, ie: the identity automorphism. 
Thus the Galois group has order 1, whereas the degree of the extension is 3. 



Theorem 8 Let f be a polynomial over F and E = F(a\, . . . , a m ) its splitting field over F with the 
roots tti of f distinct. Moreover, suppose that 



Then there is a a e Gal (E/F) with a((Xi) — /?, if and only if is a root of the minimum polynomial 
over F of ' cti. 

Proof: That is is necessary for to be a root of f has already been established. On the other hand, the 
condition on the degree of the extension means that the order of the Galois group Gal (E/F) is equal to 
the product of the degrees of the f t , and so for the correct number of automorphisms to be realised, it 
must be possible to send on to any root of its minimum polynomial /j. □ 

(12.17) In the first lecture we looked at the automorphisms of Q(a, uj) for 



which in our new language translates as finding the elements of the Galois group Gal (Q(a, u)/Q). The 
minimum polynomial of a over Q is x 3 — 2 with roots a, aui, auj 2 and the minimum polynomial of uj 
over Q is 1 + x + x 2 with roots uj, uj 2 . The Tower law then gives that 



Thus by the Theorem above, we may send a to anyone of a, auj, aui 2 and u> to any one of uj, uj 2 and get 
an automorphism. This gives six possible automorphisms, agreeing with the six we found in Lecture 1, 
one for each symmetry of the equilateral triangle formed by the roots in C. Following this through with 
the vertices of the triangle, we have three automorphisms with w mapped to itself: 



(12.16) 



[E : F] = H[F( ai ) : F}. 



a 




[Q(a,uj) : <Q>] = [Q(a, uj) : Q(a)][Q(a) : Q] = [Q(uj) : Q}[Q(a) : Q}. 
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Exercise 118 Let a = V2 and ui = cos(27r/5) + i sin(27r/5) (so that a 5 = 2 and uj 5 = 1). Letting /3 = a + ui, eliminate 
radicals by considering the expression (/3 — uj) a = 2 and find a polynomial of degree 20 having /3 as a root. Show that this 
polynomial is irreducible over Q and hence that 



{®(a + u>) : Q] = IQ(a) : Q][Q(u>) : Q]. 

Finally, show that Q(a +u) = Q(a, w). 



(12.18) Returning to some of the other examples from the first lecture, the extension Q C Q(a,u>) 
satisfies the criterion of the Theorem above, where a — \/2 and w is a primitive 5-th root of 1. Thus 
an automorphism is free to send a to any root of the polynomial x 5 — 2 and u to any root of the 5-th 
cyclotomic polynomial 1 + x + x 2 + x 3 + x 4 . Thus there are twenty elements of the Galois group in total. 



auj 




4 

In particular we have the automorphism that sends a to itself and w to uj 3 as depicted in the picture. 



(12.19) So far we have only considered the Galois groups of fields, but if we are to get closer to the spirit 
of the first lecture, then we should be more interested in the Galois groups of polynomials. In the first 
lecture we achieved this using the smallest field containing the roots of the polynomial, and indeed we 
define: the Galois group of the polynomial f <E F[x] is the Galois group Gal (E/F) of the splitting field 
E of /. Denote the group by Gal (/). 

Proposition 4 The Galois group of a polynomial of degree d is isomorphic to a subgroup of the symmetric 
group S d . 

Proof: If {a\,d2, ■ ■ ■ , ad} are the roots of /, then Gal (/) = Gal (E/F) where the splitting field E 
is given by E — F(a\,a2, ■ ■ ■ , ay). Any element of Gal (/) is determined by where it sends each a,, 
which must be to some root of its minimum polynomial over F. For any i, this minimum polynomial 
divides / (recall that the minimum polynomial of a divides any polynomial having a as a root) so its 
roots are contained amongst the roots of /, ie: amongst the {ai, a 2 , ■ . ■ , ay}- Thus, any element of the 
Galois group can be identified with a permutation of these d roots. Different automorphisms correspond 
to different permutations, as the effect of the automorphism on the roots determines the whole automor- 
phism. Thus Gal (/) may be identified with a subgroup of the group of permutations of the d roots, which 
is clearly isomorphic to Sd- □ 

Aside. There is a slick algebraic (some would say proper) way to put this, although it loses a little of the intuitive nature of what is 
going on. As any element of Galois group defines a permutation of the roots, define a map Gal (E/F) — > Sym{oi , 02, ■ ■ ■ , <*<;} 
by sending a a £ Gal (E/F) to this permutation. As the group operation is composition of maps in both these groups, we get that 
this is a homomorphism. If a S Gal (E/F) is sent to the identity permutation, then as an automorphism it fixes all the roots, so 
must be the identity automorphism, ie: the kernel of the homomorphism is trivial. The first isomorphism theorem for groups then 
gives that Gal (E/F) /{id} Gal (E/F) = H,sl subgroup of Sym{ai, a 2 , . . . ,a d } S d . 
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Further Exercises for §12. 
Exercise 119 Show that the following Galois groups have the given orders: 

1. |Gal(Q(V2)/Q)| =2. 

2. |Gal(Q(^2)/Q)| = 1. 

3. |Gal(Q(-i + ^i)/Q)| =2. 

4. |Gal(Q(^2,-i + ^i)/Q)| =6. 

Exercise 120 Find the orders of the following Galois groups: 

1. Gal (L/Q), where L is the splitting field of the polynomial x — 2. 

2. Gal (L/Q), where L is the splitting field of the polynomial x 2 — 2. 

3. Gal (L/Q), where L is the splitting field of the polynomial x 5 — 2. 

4. Gal (L/Q), where L is the splitting field of the polynomial 1 + x + x 2 + x 3 + x 4 . 

5. Gal (L/Q), where L is the splitting field of the polynomial 1 + x 2 + x 4 (hint: (x 2 - 1)(1 + x 2 + x 4 ) = x 6 - 1). 
Exercise 121 Let p > 2 be a prime number. Show that 

2-7T 27T 

1. |Gal(Q(cos hisin — )/Q)| = p - 1. 

V V 

2. |Gal (L/Q)\ = p(p — 1), where L is the splitting field of the polynomial x p — 2. Compare the answer when p = 3 and 5 
to Lecture §1.. 

Exercise 122 Let pi, ... , p m be distinct primes. Show that, 



Gal (Q( 




")/Q) = Z 2 x-xZ 2 



m times 
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§13. Linear Algebra II: Solving equations 

This section exists purely to provide some of the technical results we need for the big theorem of the next. 
It can be skipped over on a first reading. 

(13.1) Let V be a n-dimensional vector space over the field F, with fixed basis {a\,a 2 , ■ . . , a n }. A 
homogenous linear equation over F is an equation of the form, 

aixi + a 2 x 2 H h a n x n = 0, 

with the en in F. The vector v £ V is a solution if 

n 

v = ^2 ti a i a i t i + a 2^2 H 1- a n t n = 0. 

i=l 

Call a system of linear equations, 

CLl\X\ + ai 2 X 2 H h ffllnSn = 0, 

a 21 x x + a 22 a; 2 H h a 2 „a;„ = 0, 

a fe ixi + a fe2 x 2 H h afen^n = 0, 

independent over F iff the collection of vectors, 

vi = y^aijaj,v 2 = *^2a 2j aj, . . . , v k = a^aj, 

in V are independent. 
Exercise 123 

1 . Let S be an independent system of equations in n unknowns. Show that S has the unique solution v = in V . 

2. Let S be a system of independent equations in V and let S' be a proper subset of the equations. Show that the set of 
solutions in V to S is a proper subspace of the set of solutions in V to S'. 

Exercise 124 Let F C E be an extension of fields and B a finite set. Let Vp be the F-vector space with basis B, ie: the elements 
of Vp are formal sums 

with the A; g F and the b, e B. Formal sums are added together and multiplied by A g F in the obvious way. Similarly let Vp 
be the F-vector space with basis B, and identify Vp with a subset (it is not a subspace) of Vg in the obvious way. Now let S, S' 
be systems of equations in Vp as in the previous exercise. Show that the conclusion reached there is still true when looking at the 
solution sets in Vp . 

Exercise 125 Let F be a field and a\ , . . . , a n +i distinct elements of it. Show that 

/ a? ••• ai 1 \ 

dct : : : ^ o. 

V <+l ■■■ «n + l 1 / 

(hint: suppose not, and find a polynomial of degree n having n + 1 distinct roots in F, thus contradicting Theorem 2). This is 
called the Vandermonde determinant. 

Lemma 5 Let F be afield and f, g G F[x] two polynomials of degree n over F. Suppose that there exist 
n+1 distinct values ai € F, such that = g{cti) for all i. Then f = g. 

Proof: Let 

f( x ) = X! aix% and g ( x ) = ^2 hiX% - 
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We get n + 1 expressions of the form 

where t/j = ai — 6j. Think of these last as n + 1 equations in the n + 1 unknowns y^. The matrix of 
coefficients is 

ai ■■■ ai 1 \ 
••• 1 / 

which is the matrix given in Exercise 125. Its determinant is non-zero, and thus the system of equations 
has the unique solution yi = for all i, so that a, = bi for all i and hence / = g. □ 

(13.2) Here is the result we will require in the next section. 

Theorem 9 Let F C E = F(a) be a simple extension of fields with the minimum polynomial of a 
over F having distinct roots. Let {<ti, a 2 . . . , <Jk} be distinct non-identity elements of the Galois group 
Gal {E IF). Then 

(j\{x) = <r 2 (x) = ■ ■ ■ = a k (x) = x, 
is a system of independent linear equations over E. 

Proof: By Theorem D we have a basis { 1 , a , a 2 , . . . ,a d } for E over F where the minimum polynomial 
/ of a has degree d+1. Thus any x e E has the form 

X = Xq + X\a + X2C? + • • • + Xda d , 

for some Xi € F. By the Extension Theorem, the elements of the Galois group send a to roots of /. 
Suppose these roots are {a = cto, ai, . . . , ad} where o~i(a) = ai (as none of the ai are the identity, we 
have that no Uj sends a to itself). Then x satisfies <Tj(x) = x if and only if, 

(a - ai)xi + (aft - af)x 2 H h («o ~ «i = °- 

Thus we have a system of equations Ax = where the matrix of coefficients A is made up of rows from 
the larger d x d matrix A given by, 

/ a — «i a 2 , — a 2 • • • Q!q — af \ 
a — a 2 a\ — a\ ■ ■ ■ aft — a 2 

\ a -a d a\-a d ■■■ a$ - a d d ] 
Suppose we have Ab = for some vector b e E n , so that 

b Q a Q + b x al H h = b o a i + b i a i ^ ^ b da d , 

for each 1 < i < d. Thus if / = box + b\x 2 + • • • + bdx d , then we have f(ao) = f{a\) = f(a 2 ) 

■ ■ ,f(ad) = a, say. Thus the degree d polynomial g = f — a agrees with the zero polynomial at d + 1 
distinct values, hence by the lemma, must be the zero polynomial, and so all the at are zero. Thus, the 
system of equations Ax = 0, hence the system Ax = 0, is independent. □ 
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14. The Fundamental Theorem of Galois Theory 



(14.1) In §10. we saw that a complex number ( was constructible precisely when there was a tower of 
fields, 

Q = K CK 1 CK 2 C...CK n , 

with each Ki a degree two extension of Ki-i and Q(C) contained in the top field. All the examples we 
have given so far that use this result have showed that no such tower exists. In otherwords, they have been 
negative results, and for good reason: to use the theorem positively, to show that a number can actually 
be constructed, requires a knowledge of the fields sandwiched in between Q and Q(C)- In this section we 
prove the theorem that gives us that knowledge. 

(14.2) First we need a picture of all the fields we are interested in, analogous to the picture of all the 
subgroups of a group that we drew in §11.. 

^ 2 Let F C E be an extension of fields. Call K an intermediate field precisely when 

K is an extension of F and E is an extension of K: ie: F C K C E. The lattice 
of intermediate fields is a diagram such that if K\ and K 2 are two such fields and 
K i C K 2 , then K 2 is placed higher in the diagram than K\, with a line connecting 
1 them as shown at left. Denote this lattice by J?(E/F). 

(14.3) From now on we will work in the following situation: we have an extension F C E such that 
every irreducible polynomial over F has distinct roots in E. For example we saw in Exercise 36 that this 
is the case if F has characteristic 0. It is also true if F is a finite field, although we omit the proof here. 

Thus, the following theorem includes in its remit the fields we have spent most of the time considering: 
subfields of C and finite fields. It is only examples like ¥ p (t), the rational function field over F p (being 
infinite of characteristic p > 0, see Exercise 38) that are left out in the cold. 

The Galois Correspondence (part 1). Let F C E be a finite extension as above with E the splitting 
field over F of some polynomial f <G F[x], and G = Gal(E/F) its Galois group. Let Jf(G) and 
Jtf(E/F) be the subgroup and intermediate field lattices. 

1. For any subgroup H ofG, let 

E H = {XeE\a(X) = Xforallo e H}. 

Then E H is an intermediate field, called the fixed field of H. 

2. For any intermediate field, Gal (E/K) is a subgroup of G. 

3. The maps H E h and K Gal(E/K) are mutual inverses, hence bijections Jz?(G) <-» 
&{E/F). 

4. Both maps reverse order: if H x C H 2 C G then F C E H2 C E Hl C E, and if F C K x C K 2 C 
E then Gal (E/K 2 ) C Gal {E/K{) C G. 

5. The degree of the extension E H C E is equal to the order \H\ of the subgroup H. 

In otherwords, once you know the lattice of subgroups of the Galois group, you can find the lattice of 
intermediate fields just by turning it upside down (and vice- versa)! Schematically, 
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Gal (E/F) E 




{id} F 

There are a few other things worth noticing. The whole Galois group Gal (E/F) fixes F pointwise, 
so its fixed field is F, while the trivial subgroup, consisting of just the identity automorphism, fixes 
everything, hence its fixed field is all of E. Thus, the largest subgroup corresponds to the smallest 
intermediate field and the smallest subgroup to the largest intermediate field. 

The Theorem also says that the two maps X i— > E x and Y i— » Gal (E/Y) are bijections, hence in 
particular are 1-1: if E Hl = E H * then H x = H 2 and if Gal {E/Kx) = Gal (E/K 2 ) then K X =K 2 . 

If the upside down nature of the correspondence seems puzzling, it is simply linear algebra (and indeed 
this is how we will prove it). If if is a subgroup, think of the fixed field E H as the set of solutions in E 
to the equations, 

<j(x) = x,a G H. 

The more equations you have, the greater the number of conditions being imposed on x, hence the smaller 
the number of solutions. So larger subgroups should correspond to smaller intermediate fields. That the 
correspondence is exact, so that as soon as we add just one more equation the number of solutions strictly 
decreases, will follow from §13. as these equations are linear and independent. 

Proof: In the situation described, where E is a finite extension of F, the extension must be simple by 
Theorem 7, ie: of the form F C F(a) for some a algebraic over F. 

For the first part, we have E H C E by definition, and F C E H , as every element of G, so in particular, 
every element of H fixes F. If A, [i € E H then a(\ + /i) = cr(A) + cr(/i) = A + fi and similarly for 
<r(A/i) and a(l/\). Thus E H is an intermediate field. 

If an automorphism of E fixes the intermediate field K pointwise, then it certainly fixes the field F 
pointwise. Thus Ga\{E/K) C Gdl(E/F) and we indeed have a map ££(EjF) -> Jz?(G) given by 
K i ► Gal (E/K). If A is fixed by every automorphism in H 2 , then it is fixed by every automorphism 
in Hi and so E H2 C E Hl . If a fixes every element of K 2 pointwise then it fixes every element of K\ 
pointwise too, so that Gal (E / K 2 ) C Gal (E/K-Cj. 

To show that the two maps are inverses of each other, we take a subgroup H and show that their 
composition, 

H -y E H -» Gzl(E/E H ), 

gets us back to where we started, ie: that Gal (E/E H ) = H. This will then give the desired bijection. 

By definition, every element of H fixes E H pointwise, and since Gal (E/E H ) consists of all the 
automorphisms of E that fix E H pointwise, we have that H C Gal (E/E H ). In fact, both of the sub- 
groups H and Gal (E/E H ) have the same fixed field, ie: E G ^ E / E • ) = E H . To see this, certainly 
any a e Gal (E / E H ) fixes E H pointwise by definition, so E H C E g ' a{ - e / eH \ On the otherhand, as 
H C Gal (E/E H ) and the maps reverse order, we have, E Gdi t E / E ") C E H . 

By the results of §13., the elements of the fixed field _E Gal ^ E I E ) are obtained by solving the system of 
linear equations u[x) = x for all a <G Gal (E/E H ), and these equations are independent. In particular, a 
proper subset of these equations has a strictly larger solution set. We already have that H C Gal (E/E H ), 
so suppose that H is a proper subgroup of Gal (E/E H ). The fixed field E H would then properly contain 
the fixed field £«ai As this contradicts S ff = E Gdl ( e / eH ), we must have that H = Gal (E/E H ). 

Thus the map H E h is a bijection as desired. 
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As E is a splitting field we can apply Theorem G to get |Gal (E/E H )\ = [E : E H ], where we now 
have that Gal (E/E H ) = H, so that \H\ = [E : E H \. □ 

(14.4) We are certainly long overdue an example. In §12. we reverified the example of the first lecture to 
show that the splitting field Q(a, ui) of the polynomial x 3 — 2 had Galois group, 

G = Gal(Q(a,w)/Q) = {id, a, <r 2 , t, cft, ct 2 t}, 

where a(a) — aui, cr(u>) = ui and r(a) = a, t(u>) = lo 2 . 
We claim that the subgroup lattice j£f (G) is, 



G 




Firstly, the subsets given are easily seen to be subgroups, so we just need to check that the picture is 
complete. Let H be an arbitrary subgroup of G and suppose that H contains the element a. Then it 
must contain all the powers of a, hence must contain the subgroup {id, a, a 2 }. Thus the order of H is 
constrained by 3 < \H\ < 6, and by Lagrange's Theorem \H\ divides 6, so we must have \H\ = 3 or 6. 
Thus H must equal {id, a, a 2 } or be all of G This completely describes all the subgroups that contain 
the element a. The same argument (and conclusion) applies to the subgroups containing a 2 . Thus we 
are left to describe the subgroups containing any one of the three "reflections" t, err, <t 2 t but not a or a 2 . 
Let H be a subgroup containing r. As H contains {id, t}, and by Lagrange, it has order 2, 3 or 6. The 
only one of these three possibilities not already in the lattice is the order 3 case, so we show that this is 
not possible. To have order 3, H must also contain one of ctt or <j 2 t. If the former, then it also contains 
(ttt = a, a contradiction, and similarly for the other case. Thus the lattice (G) is as depicted 13 . 
The Galois Correspondence now gives the lattice Jj?(E/F) of intermediate fields to be, 




Q 

with F 2 the fixed field of the subgroup {id, a, a 2 } and the others the fixed fields (in no particular order) 
of the three order two subgroups. By the fourth part of the Galois correspondence, each of the exten- 
sions Fi C Q(a, d) has degree the order of the appropriate subgroup, so that Q(a, u) is a degree three 
extension of F 2 , and a degree two extension of the other intermediate fields. 

Suppose that F\ is the fixed field of the subgroup {id, r}. We find an explicit description of its elements. 
From the Tower law, a basis for Q(a, u) over Q is given by 

{1, a, a 2 ,u>, auj, a 2 uj}, 



In general such arguments become more complicated as the order of the Galois group increases. 
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so that an arbitrary element x of Q(a, u) has the form, 

x = ao + aia + (120? + a^uj + a^ato + a^a 2 uj, 
with the di € Q. The element x is in Fi if and only if t(x) = x where, 
t{x) = a a + a\OL + a 2 a 2 + a 3 u> 2 + a^auj 2 + a 5 a 2 uj 2 

= a + a\Ot + a 2 a 2 + a 3 (— 1 — lu) + aia(— 1 — oj) + a 5 a 2 (— 1 — uj) 
= (a — 03) + (cii — a^)a + (a 2 — a 5 )a 2 — a 3 uj — a^aiv 2 — a 5 a 2 LO. 
Because we are using a basis, we can equate coefficients to get, 

a o — a>3 = a o, o-i — = ttij &i — «5 = 0-2, —(13 = a<3, —0"k = »4 and — = a^. 

Thus, a3 = 04 = 05 = and ao, ai, a 2 are completely arbitrary. Hence x has the form x = ao + a\a + 
a 2 o? so is an element of Q(a). This gives that i*\ C Q(a). On the otherhand, r fixes Q pointwise and 
fixes a by definition, hence fixes every element of Q(a). This gives that Q(a) C Fi and so Fi = Q(a). 

(14.5) The Galois correspondence allows us to "model" the subgroups of the Galois group by the inter- 
mediate fields (and vice-versa). But there are subgroups and there are subgroups: what about the normal 
subgroups? As they are slightly special, they should correspond to slightly special intermediate fields. Is 
the Galois correspondence sensitive enough to spot the difference? 

Let F C E be an extension of fields with Galois group Gal (F/F), and let K be an intermediate field 
and a € Gal (F/F). The image of K by the automorphism a is another intermediate field, and so we get 
the picture below left. By the Galois correspondence, there are subgroups Gal (F/F) and Gal (E/a{K)) 
corresponding to the two intermediate fields as shown below right: 

F Gal (F/F) 



\ ^ CorrSponkce Gal ^ <*K?M*)) 

F {id} 
The two intermediate fields are then related by, 

Proposition 5 The subgroups Gal (E/K) and Gal (E/a(K) ) are conjugate, indeed, 

Gal (E/a(K)) = a~ 1 Gal (E/K)a. 
(We are reading expressions in a group from left to right). 

Proof: If x G cr{K), then x = a(y) for some y G K. Thus if cr g Gal (E/K), then o^lio (read from 
left to right) fixes x, and so is contained in Gal (E/a(K)). Thus cr^Gal (E/K)a C Gal (E/a(K)). 
The proof of the opposite inclusion is the same. □ 

(14.6) Remembering that a subgroup TV of G is normal when g~ 1 Ng = N for all g G G (see §11.). We 
have cr" 1 Gal (E/K)a = Gal (E/a(K)), and this in turn will clearly equal Gal (E/K) when a(K) = K 
for all a. So this is the kind of intermediate field that picks out normal subgroups: one that is sent to itself 
by any automorphism 14 . 

If every automorphism sends K to itself then any automorphism of F restricts to an automorphism of 
K as well. This is all summarised in the second part of the Galois correspondence: 

The Galois Correspondence (part 2). Under the assumptions of the first part of the Galois correspon- 
dence, let K be an intermediate field. Then, <j(K) — K for all a G Gal (E/F) if and only if Gal (E/K) 
is a normal subgroup of Gal (E/F), and in this case, 

Gal(E/F)/Gal(E/K)=Gal(K/F). 



14 Note that this is different from saying that the field is fixed pointwise, which is a far stronger property 
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Proof: if a(K) = K for all a then by Proposition 5, a -1 Gal {E/K)a = Gal (E/a(K)) =Gal{E/K) 
for all <t and so Gal (i? / K) is normal. On the otherhand, if Gal (E / K) is normal then Proposition 5 gives 
that Gal (E/a(K)) = Gal (E/K) for all a, where X ^ Gal (E/X) is a 1-1 map by the first part of the 
Galois correspondence, hence we have a(K) — K for all a. 

Now define a map Gal (E/F) — > Gal (K/F) by taking an automorphism cr of fixing F pointwise 
and restricting it to K C _E. We get an automorphism of if as <r(K) = K for any a. The map is a 
homomorphism between these two groups rather trivially, as the same operation, namely composition of 
automorphisms, is being used in both. An element a is in the kernel of the homomorphism if and only 
if it restricts to the identity map on K (ie: fixes K pointwise when restricted) which happens if and only 
if cr is in Gal (E/K). If a is an automorphism of K fixing F pointwise then by Theorem ??, it can be 
extended to an automorphism of E fixing F pointwise, ie: any element of the Galois group Gal (K/F) 
can be obtained by restricting an element of Gal (E/F). Thus the map is onto and the isomorphism 
follows by the first isomorphism theorem. □ 

We used Theorem ?? in the proof to show that any element of Gal (K/F) was the restriction of an 
element of Gal (E/F). Moreover, Theorem ?? says that an element of Gal (K/F) will be the restriction 
of [E : K] elements of Gal (E/F) and this gels perfectly with the isomorphism given above: the identity 
of Gal (K/F) will be the restriction of [E : K ] = |Gal (E/K) \ elements of Gal (E/F), in otherwords, 
the kernel of the mapping given in the proof will have |Gal (E/K) \ elements. 

Exercise 126 A subgroup H of a group G is said to be malnormal when g 6 G \ H gives that g~ 1 Hg D H = {id}. Thus, the 
malnormal subgroups are in some sense the antithesis of the normal ones. Show that the malnormal subgroups can be spotted by 
the Galois correspondence by describing the intermediate fields they correspond to. 

(14.7) Here is a simple application. According to Exercise 110, any subgroup of index two in a group G 
is a normal subgroup. By the first part of the Galois correspondence, subgroups of index two correspond 
to intermediate fields F C K C E with the degree of the extension F C F equal to two. By the second 
part of the Galois correspondence, any automorphism of E fixing F pointwise must send such a if to 
itself. 

Further Exercises for §14. 
Exercise 127 Complete the example above: 




Exercise 128 

1. Let a = \/2 £ M and i 6 C, and consider the field Q(a, i) C C. Suppose that a, r are automorphisms of Q(a, i) such 
that 

cr(i) = i, cr(a) = ai, r(i) = — i, and r(a) = ex. 

Show that 

G = {1, a, a 2 , cr 3 , t, <tt, <t 2 t, <t 3 t}, 
are then distinct automorphisms of Q(q, i), and that to = cr 3 r. 

2. Suppose now that the G above is the Galois group of Q(a, i) over Q, and that G has the lattice of subgroups as shown on 
the left: 
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{1,<j 2 ,t,<j 2 t} {I, a, a 2 , a 3 } H t 




H 2 {1,t} {I,* 2 } {1,v 3 t} H 3 





l+i)a) 



Find the subgroups Hi , H2 and H3 of G. If the corresponding lattice of subfields is as shown on the right, then express the 
fields F\ and F2 in the form Q(/3i , . . . , (3 n ) for some /3i , . . . , f3 n 6 C. 

Exercise 129 

1. Let a; = cos — — h i sin — . Show that Q(lo) is the splitting field of the polynomial 

1 + x + x 2 + x 3 +x 4 +x 5 + x 6 . 

Deduce that |Gal (Q(w)/Q)| = 6. 

2. Suppose o- e Gal (Q(w)/Q) is such that = a; 3 . Show that, 

Gal(Q(w)/Q) = {1,0-, o- 2 ,<t 3 ,(t 4 ,(7 5 }. 

3. Using the Galois correspondence, show that the lattice of intermediate fields is: 

/\ 
(/ 



where Fi is a degree 2 extension of Q and F2 a degree 3 extension. Find complex numbers /3i, . . . , /3 n such that F2 

Q(/?l,...,/3„). 

1 v^3 

Exercise 130 Let q = \/2 and a; = - H — — » and consider the field extension Q C Q(a, w). 

1. Show that |Gal(Q(a,w)/Q)| =24. 

2. Find a basis for Q(a, u>) over Q. 

3. Suppose that cr, t G Gal (Q(a, cj)/Q) are such that T:«Ha,i)Hw 5 and it : an au, a; 1— » ui. Show that 

rr n 2 3 4 5 2 3 4 Bi 

il = |1,(T, a ,<J ,<T ,<J ,T,Ta,TU ,TO ,TO ,TIJ }, 

are then distinct elements in Gal (Q(a, w)/Q) too (do this by observing their effect on the basis). 

4. Par? of the subgroup lattice ££q is shown below. Find the corresponding part of the lattice of intermediate fields. 




{id,aV 4 } 
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Exercise 131 Let u> = cos h i sin — and consider Q(u>). 

5 5 

1. Show that Q(ut) is the splitting field of the polynomial 1 + x + x 2 + x 3 + a; 4 . 

2. Deduce that |Gal (Q(w)/Q)| = 4. 

3. Suppose cr £ Gal (Q(tt>)/Q) is such that <r(ci>) = a; . by combining parts (a) and (b), show that, 

Gal(QH/Q) = {1,<T, a 2 , a 3 }. 

(hence the Galois group is cyclic). 

4. Find the subgroup lattice ££ G for G = Gal (Q(u)/Q). 

5. Using the Galois correspondence, deduce that the lattice of intermediate fields is 



Find a complex number (3 such that F ■ 



Exercise 132 Consider the polynomial f(x) = (x 2 — 2)(x 2 — 5) G Q[x]. 

1 . Show that Q(\/2, V5) is the splitting field of / over Q. 

2. Show that the Galois group Gal (Q(%/2, \/5) /Q) has order four. 

You may assume that if a, b,c 6 Q satisfy a\/2 + fe\/5 + c = f/ierc a = b = c = 0. 

3. Assume that a and t are automorphisms of Q(%/2, v^5) defined by, 

v 7 ! ' ^ -a/2 \/2 ^ \/2 



x/5 



-V5 



List the elements of the Galois group Gal (Q(%/2, %/5)/Q), justifying your answer. 
4. Complete the subgroup lattice on the left by listing the elements of H, 

Gal (Q( v / 2, V5)/Q) Q(V2, x/5) 



{id, a} 




{id,r} 



{id} 





Q(\/2) Q( v / 5) 




and use your answer to write the field F in the form Q(0) for some 9eC. 
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§15. Applications of the Galois Correspondence 

Constructing polygons 

If p is a prime number, then a regular p-gon can be constructed only if pis a Fermat prime of the form 

2 2 * + 1. 

We proved this back in §10., and all it required was the idea of the degree of an extension. In otherwords, 
we really didn't require any Galois Theory in the proof, if you take Galois Theory to mean the interplay 
between fields and their Galois groups. 

What about results in the positive direction? Can p-gons with a Fermat prime number of sides be con- 
structed? The first few such primes are 3, 5 and 17, and we saw in §7. that these three were constructible, 
albeit if we believe Gauss's identity for cos(7r/17). Thus, explicit constructions of these polygons is a 
complicated business. Nevertheless, the full power of Galois Theory proper gives, 

Theorem 10 If p is a Fermat prime then a regular p-gon can be constructed. 

Proof: By Theorem E we are done if we can find a tower of fields, 

Q C K x C • • • C K n = Q(C), 

for ( = cos(2tt/p) + isin(27r/p) and with [JQ : K^] = 2. As Q(() is the splitting field of the p-th 
cyclotomic polynomial 

$p(ar) =x p ~ 1 +x p ~ 2 + ---+X + 1, 
we have by Theorem G that the Galois group has order, 

|Gal (Q(C)/Q)| = [Q(C) : Q] = dcg $p = p - 1 = 2", 

(as p, being a Fermat prime is of the form p = 2™ + 1). In §12. we showed that Gal (Q(C)/Q) was a 
cyclic group, and so by Exercise 100, we can find a chain of subgroups 15 

{id} = tf C H x C • • • C H n = Gal (Q(C)/Q), 

where Hi has order 2\ Making it explicit, if Gal (Q(C)/Q) = {id, a, <r 2 , . . . , cr 2 " -1 } then the subgroups 
are, 

{id} C {id,cr 2 " _1 } C {id,cr 2 " _2 ,cr 2 - 2 " _2 ,cr 3 - 2 " _2 } C ■■■ 

■■■ C {id,CT 2 "~\t7 2 - 2 "~\<7 3 - 2 "~\...} C ••• C {id, <7 2 , (T 4 , . . .}. 

The Galois correspondence thus gives a chain of fields, 

® = K CK 1 C...CK n = ®((), 

where K n _i is the fixed field E Hi of the subgroup Hi. Letting j = n — i, we have the extension 
Kj C Q(C) of degree the order 2 l of Hi. In particular, by the tower law, 

[<Q(0:K j _ 1 ] = [<Q(0:K j }[K j :K j _ 1 }, 

where j - 1 = n - (i + 1), so that [Q(C) : = 2 i+1 . Thus 2 4+1 = 2 i [K j : K^], so that 

[Kj : Kj-{\ = 2 as required. □ 

Corollary. Ifn = 2 k p\p^ ■ ■ ■ p m with the pi Fermat primes, then a regular n-gon can be constructed. 

Proof: Certainly 2 fe -gons can be constructed just by repeatedly bisecting angles. Thus, an n-gon can be 
constructed, where n has the form given, by Exercise ??. □ 

Remarkably, with a little more Galois Theory, the converse to this statement can also be proved, thus 
completeley determining those n-gons that can be constructed. 

15 Alternatively, these subgroups can be found using Sylow's Theorem from §11.. 
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(15.1) The angle ir/n can be constructed precisely when the angle 2w/n can be constructed which in 
turns happens precisely when the regular n-gon can be constructed. Thus, the list of submultiples of tt 
that are constructable runs as, 

TT TT TT TT TT TT TT TT TT 

2' 3' 4' 5' 6' 8' 10' 12' 15'"' 

Exercise 133 Give direct proofs of the non-constructability of the angles, 

7T 7T 7T 7T 

— , — , — and — . 

7 9 11 13 

The Fundamental Theorem of Algebra 

The so-called fundamental theorem of algebra can be proved from the Galois correspondence, and we 
have already observed how curious it is that a theorem fundamental to all of algebra can be deduced from 
a theorem fundamental to just part of it. 

The proof requires two straight-forward observations. First, there are no extensions of the reals of odd 
degree > 1. This is because any polynomial in R[x] has roots that are either real or occur in complex 
conjugate pairs, hence in particular, a real polynomial with odd degree > 1 has a real root and so is 
reducible over R. Thus, the minimum polynomial over R of any a £ R must have even degree. If R C L 
is an extension, then choosing a e L \ R, we have 

[L:R] = [L: R(a)][R(a) : R], 

with the last term even by the comments above, hence [L : R] even. 

The other observation is that the complexes have no extensions of degree two. If C C L with \L : 
C] = 2 then choose a e L \ C so that we have the intermediate C C C(a) C L. We must certainly have 
[C(a) : C] = 1 or 2, and if the degree was 1 then we would have a € C, so [C(a) : C] = 2, and thus 
L = C(a). If / is the minimum polynomial of a over C then / = x 2 + bx + c for some 6, c <G R and a 
will be one of the two roots 

-b± Vb 2 -4c 
2 ' 

which are in C, contradicting the choice of a. 

Fundamental Theorem of Algebra. Any non-constant / e C[i] has a root in C. 

Proof: The proof toggles back and forth between intermediate fields and subgroups of Galois groups 
using the Galois correspondence. If the polynomial / is reducible over R, with / = pq, then replace / 
by p and continue. Thus we may assume that / is irreducible over R and let E be the splitting field over 
R not of /, but of (x 2 + 1)/. Thus in particular we have that R and ±i are in E, hence C is too, and thus 

ICCCB. 

The conditions of the first part of the Galois correspondence hold for E, so we may apply this to the 
Galois group G = Gal (E/R.). Since G is a finite group, we may factor from its order all the powers of 
2, writing \G\ = 2 k m, where m > 1 is odd. In particular, Sylow's Theorem gives us a subgroup H of G 
of order 2 fc , and so by the Galois correspondence we have the picture: 

G E 

k 



A 2 



\2 

// Gal0is , p = F H 

Correspondence 

k 



{id} 

with the intermediate field F corresponding to H giving an extension F C E of degree 2 fe . As [E : R] = 
[E : F][F : R] with [E : R] = \G\ = 2 k m, we must have that F is a degree m extension of R. As m is 
odd and no such extensions exist if m > 1, we must have m = 1 and so \G\ = 2 k . 
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We now use the Galois correspondence in the reverse direction: 
E G 

n Galoi ' ■ ^Gd (E/C) 

Correspondence ' ' 

M {id} 

As G has order 2 k , the subgroup Gal (E/C) has order dividing this, hence order 2 s . lis > IthenSylow's 
Theorem again gives us a subgroup H of Gal (E/C) of order 2 S_1 , and we have the picture: 

G— E 
Gal (E/C) 



2 S ^> 
{id} 



Galois 

jj Correspondence 




with 2 S - 1 [E H : C] = [E : C] = |Gal(£/C)| = 2 s , hence S ff is a degree 2 extension of C. We 
commented above that there are no such extensions, thus we must have s = 0, and so |Gal (E/C)\ = 0, 
giving that Gal (E/C) is the trivial group. We have then two fields, namely E and C, that map via the 
1-1 map X i ► Gal (E/X) to the trivial group, so E = C. As E was the splitting field of the polynomial 
(x 2 + 1)/, we get that / has a root, indeed all its roots in C. □ 
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16. (Not) Solving Equations 



At the beginning of these notes we said that Galois Theory was initially motivated by the desire to un- 
derstand better the roots of polynomial equations. In particular, to provide a context for the growing 
conviction in Galois' time that there is no formula for the roots of an arbitrary polynomial equation, and 
that the classical formulae that exist for quadratics, cubics and quartics are some kind of "low degree 
fluke". 

(16.1) The formulae for the roots of quadratics, cubics and quartics express the roots in terms of the 
coefficients, the four field equations +, — , x, and 3/, ij. When we say we want a formula for the 
roots of polynomials in Q[x] then, it seems reasonable that it should express the roots in terms of rational 
numbers, +, — , x , and ™/ for some m. In particular the roots of the polynomial will be contained in 
an extension of Q obtained by adjoining certain m-th roots. 

With this in mind, an extension Q C E is called radical if and only if there is a sequence of simple 
extensions, 

Q C Q(ai) C Q( ai ,a 2 ) C • • • C Q(ai, a 2 , ...,a k ) = E, 

such that a™* <G Q(ai, ct2, ■ ■ ■ , oa-i) for every i. Thus, each extension in the sequence is obtained by 
adjoining to the previous one an nn-th root of an element. 

(16.2) A simple example of a radical extension is, 

Q C Q(V2) C Q(V2, #5) C Q &5, y/y/2-7^5j . 

By repeatedly applying Theorem D, we see that the elements of a radical extension have expressions in 
terms of rational numbers, +, — , x , and ™^/. 

(16.3) If we are looking to find a formula for the roots of a polynomial, then these roots will have precisely 
these kind of expressions. Thus we say that a polynomial / e Q[x] is solvable by radicals if and only if 
its splitting field over Q is contained in some radical extension. 

Notice that we are dealing with a fixed specific polynomial, and not an arbitrary one. The radical 
extension containing the splitting field will depend on the polynomial. 

(16.4) Any quadratic polynomial ax 2 + bx + c is solvable by radicals, with its splitting field contained 
in the radical extension 

Q C Q(V& 2 - 4ac). 

Similarly, the formulae for the roots of cubics and quartics give for any specific such polynomial, radical 
extensions containing their splitting fields. 

(16.5) Now we have a precise idea of what we mean by "finding a formula for the roots of a polynomial", 
we are ready to wheel in the Galois theory. In §11. we called a group G soluble if and only if there is a 
sequence, 

{1} = H a < H! < • • • < H n -x <H n = G, 
such that the successive quotients Hi/H , H 2 /Hi, . . . , H n /H n _i are all Abelian groups. 

Theorem H (Galois). A polynomial f e Q[x] is solvable by radicals if and only if its Galois group 
Gal (/) is soluble. 

The proof, which we omit, uses the full power of the Galois correspondence, with the sequence of 
extensions in a radical extension corresponding to the sequence of subgroups in a soluble group. 
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(16.6) Somewhat out of chronological order, we have, 

Theorem 11 (Abels-Fubini) The polynomial x 5 — Ax + 2 is not solvable by radicals. 

Proof: We need to show that the Galois group Gal (/) is insoluble. Indeed, we show that it is the 
symmetric group S$, which contains the non-Abelian, finite simple group A 5 . Thus 5 5 contains an 
insoluble subgroup, hence must be insoluble as well, as any subgroup of a soluble group is soluble by 
Exercises 102 and 103. If E is the splitting field over Q of /, then 

E = Q(ai,a2,a3,a4,a 5 ), 

where the on are the roots of / and the Galois group of the polynomial is Gal (E/Q). The elements of 
this groups are, as usual, completely determined by where they send the a,, and by one of the Corollaries 
to the Extension Theorem, they must be sent to roots of /. The conclusion is that the elements of the 
Galois group of the polynomial must permute the roots amongst themselves, so that Gal (/) is a subgroup 
of the symmetric group 5 5 . 

As ax has minimum polynomial / over Q, the extension Q C Q(ai) 
has degree five, and then the tower law gives that 

[E : Q] = [E : Q( ai )][Q( ai ) : Q}. 

Thus, the degree of the extension Q C E is divisible by the degree of the 
extension Q C Q(ai), ie: divisible by five. Moreover, by Theorem G, 
the group Gal (E/Q) has order the degree [E : Q], thus the group has 
order divisible by five. By Sylow's Theorem, this means that the Galois 
group contains a subgroup of order five. The only groups of order five are the cyclic ones, and as every 
element of the Galois group is already a permutation of the five roots, this subgroup must have the form, 

{id, a, a 2 , a 3 , a 4 }, 

for a permutation a that is a 5-cycle a — (a i± , on 2 , a i3 , on 4 ,a i5 ). By drawing the graph of / as shown, 
we see that three of the on are real, and so the other two must be complex conjugates. We saw in §12. 
complex conjugation is an automorphism of E, and this must fix the three real roots, and interchange the 
two complex ones. This gives us another automorphism r of E that as a permutation has the form, 

t = (ai,aj), 

where a,, aj are the two complex roots. □ 

It is worth meditating briefly on the philosophical implications of this result, which are profound. The 
Theorem says that there is no possible expression for the roots of the polynomial in terms of rational 
numbers, the four field operations +, — x , and roots ^/ for any m. At first this may seem no great 
problem; we know plenty of real numbers with this property, eg: n. But the roots of the polynomial 
are algebraic numbers, so there is something more, something very subtle, to the notion of an algebraic 
number than it just being expressible in "algebraic terms". 

(16.7) It is sometimes possible to establish the existence of numbers with special properties by counting. 
For example, to explicitly show that a given real number is transcendental is complicated. If we count the 
non-transcendental (ie: the algebraic) numbers we see though that they are countable: they can be put in 
1-1 correspondence with the integers Z, whereas the real numbers are not. Thus, there are many more 
real numbers than algebraic ones, so transcendentals must exist, indeed greatly outnumber the algebraics. 

Such a naive approach will not work to establish the existence of the roots of equations not solvable by 
radicals, as all the sets involved now are countable. 
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(16.8) So how bad is it then? We have one polynomial, x 5 — Ax + 2 for which no algebraic expression 
exists for its roots, but is this an isolated incident, or at least one that is rare? In fact, polynomials not 
solvable by radicals are generic, in the sense that polynomials that are solvable by radicals are the ones 
that are relatively rare. 

We can illustrate this phenomenon at least with some examples. Consider the quintic polynomials 

x + ax + 6, 
for a, b 6 Z and in the range —40 < a, b < 40. 



40 



-40 



factors 



□ As 



F 2I 



-4, 



soluble 



> insoluble 



□ ft 



-40 







40 



The picture 16 illustrates the (a, b) plane for this range of a and b. The vertical line through (0, 0) corre- 
sponds to / with Gal (/) the soluble dihedral group Dio of order 10. The horizontal line through (0, 0) 
and the two sets of crossing diagonal lines correspond to reducible /, as do a few other isolated points. 
The (insoluble) alternating group A$ arises in a few sporadic places, as does another subgroup of S$. 
However, the vast majority of /, forming the light background, have Galois group the symmetric group 
1S5, and so have roots that are algebraic, but cannot be expressed algebraically. 



which is based on an image from the Mathematica poster, "Solving the Quintic". 
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17. Selected Solutions 



(4) If u = to + to 1 = to + to 4 then u 2 = to 2 + to 3 + 2 so the quadratic we want is u 2 + u — 1 = 0. This has roots 

-i±ys 

2 ' 

We get uJ 2 — uu + 1 = by multiplying through by to~ L , hence 

u ± \/u 2 — 4 

u; = . 

2 

and substitute u into this to get, 



^±vl + ^^±^1)2 _ 4 ^i±vI_ v /P±Vi ) 2_4 



(I don't expect you to actually do this!) 



(6) Note that to is a sixth root of 1 (in fact it has argument 2-7r/6 and modulus 1) so satifies to 6 = 1 (but to k ^ 1 for any k 
between 1 and 5). Clearly ato 2 and to 5 are e Q(a, w), so that Q(aw 2 , a; 5 ) C Q(a, a;). Conversly, aw 2 ,u 5 G <Q(aw 2 , a; 5 ) => 
oii) 2 w 5 w 5 G Q(ato 2 ,to 5 ), but ati; 2 Lj 5 a; 5 = aw 12 = a since = 1. Thus a g Q(aw 2 , u 5 ) and hence a^'nu 2 = to 2 is too, 
and so finally to 2 to 5 = to 7 = w (since to e = 1). Thus Q(a, to) C Q(oti; 2 , to 5 ). 

To show that to) = Q(ato' 1 , to 5 ) is entirely similar. 

(7) Consider the extension field Q(a , to) of Q. Note first that the solutions to x 5 — 2 all lie in this field, as it contains a, w and is 
closed under multiplication. 

The following paragraph is optional: Q(a, ui) is in fact the smallest field that contains the solutions. For, suppose that F is some 
field containing a, au>, . . . , aui 4 . Since we are in the complex numbers, 

n times 

for any n. Thus, F contains the rationals Q. Also, a, aw £ F gives that a and a~ 1 auj = to 6 F too, ie: F contains Q, a and to. 
But it must then contain the smallest field that does these things, ie: Q(a, to) C F, and so Q(a, to) really is the smallest. 

So, symmetries of the soltions to x' 1 — 2 are the rearrangements referred to in the first lecture, but of this new field Q(a, to). 

Looking at the picture of the pentagon, the symmetry needs to send a to itself and otto to cuo 3 . This last one suggests that it 
must send to to to 3 (if the symmetry is not to disturb the + and X of the field). 

To see if such a symmetry exists, we need to show that the fields Q(a, to) and Q(a, to 3 ) are the same. Certianly, a, to 3 G 
Q(a, to), so that Q(a, to 3 ) C Q(q, to). Conversly, to 3 g Q(a, to 3 ) gives to 3 to 3 = to 6 = to G Q(a, to 3 ). We already have that a 
is there too, so Q(o, to) C Q(a, to' 3 ). 

Finally, we need to check that the symmetry a >— ► a and to i— > to 3 , does the right thing to the vertices of the pentagon. Well, 
let's try ato 3 i-> a(to 3 ) 3 = ato 27 = cuo 2 . The others are entirely analogous. 

(19) 

1. For multiplication, let n ^ be in F p . Then the gcd of n and p must be 1, so that for some integers a, b we have 
1 = an + bp. But then an = (—b)p +1 = 1 mod p. Thus, if a = k mod p then the inverse of n is k. 

2. Let ab = for a,b £ F. Then either a = 0, or if a ^ then a _1 ab = a^ 1 => b = 0. Thus, at least one of a or b must 
be zero. Thus F is an integral domain. 

Let n = rs with 1 < r, s < n integers, and consider Z n , the ring of integers with addition and mutliplication modulo n. 
Then r, s 7^ in Z„, but rs = n = mod n. Thus Z n is not an integral domain. 

(22) 

1. Use Z1Z2 = ~z~i~Z2 and 21 + 22 = + 22 (you can easily convince yourself of these by drawing vectors in the plane). 
First note that f(z) = f(z): use the two rules above with a n x n + ■ • ■ arj, remembering that the a; G K means ~al = ai. 
Thus, 2 is arootof / <S> /(z) = <S> /(2J = = <S> /(z) = 0. 

2. There are many examples. The simplest is probably x 2 — i, since the square roots of i, by De Moivre's theorem, lie on the 
circle \z\ = 1; one has argument 7r/4, the other 5tt/4. 



(23) 



1. We have 1 = am + bn for some integers a and b, hence k = amk + bnk. If m divides nk then, as it already divides amk, 
it must also divide k as required. 
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2. If m/n is such a root, then 

Co + a\(m/n) + a2(m/n) 2 + ■ ■ ■ + a r (m/n) r = . 

Clearing denominators gives 

a^n r + ain' 1 m + a^rC 2 m 2 + ■ ■ ■ + a, — \nvri 1 + a r m r = . 

Consider n. The number n divides and clearly divides every term on the left, with the possible exception of a r m r . But 
since it divides everything else in sight, n must also divide a r m r . Our assumption that m and n be coprime implies that n 
does not divide m r , and hence we conclude that n\a r . Similarly, m\ao. Finally, if a r = 1, then we must have n e {±1}, 
so that m/n is indeed an integer. 

(26) 

1. 1 + x s has no real roots as 1 + x 8 > 1 ^ 0. But, in C we get 

i + x 8 = (x - ci)(x -ci)---(x- ( 4 )(x - a), 

by the fundamental theorem of algebra. Thus, 

i + x 8 = (x 2 + (Ci + Cl> + CiCT) ■■■(x 2 + (C 4 + Ci> + C4CZ), 

and these are real polynomials since the sum and product of a complex number with its conjugate is real. Thus 1 + x 8 is 
reducible as a product of four quadratics. 

2. Following the hint, 1 is clearly a root of y n — 1, and we get 

y n - 1 = (y - 1)(1 + y + y 2 + • • • + y"' 1 ), 

so letting y = x 2 and n = 6 gives 

(1 + x 2 + x 4 + x 6 + x 8 + x 10 )(x 2 - 1) = x 12 - 1. 

Thus the roots of the right hand side are the 12 m roots of 1. Hence the roots of the left hand side are also the 12" 1 roots of 1. 
Now ±1 are certainly the roots of x 2 — 1, and these are two of the 12 m roots, so the other ten are the roots of the polynomial 
that we are interested in. Notice that they are all g C, but two of them are ±i, hence 

(x - i)(x + i), 

are factors of 1 + x 2 + x 4 + x 6 + x 8 + x 10 , ie: x 2 + 1 is a factor of 1 + x 2 + x 4 + x 6 + x 8 + x 10 (notice that this 
argument, while more complicated than others maybe, works for 1 + x 2 + • • • + x 2n ). Having got that far, its then pretty 
easy to spot that 

1 + x 2 + x 4 + x 6 + x 8 + x 10 = (x 2 + 1)(1 + x 4 + X 8 ), 
so that the poynomial is reducible over Q. Can you generalise the argument to handle 1 + x 2 + ■ ■ ■ + x 2n l 

3. The polynomial has value 7 when x = and value —7 when x = — 1, hence by the intermediate value theorem, there must 
be a real root somewhere between —1 and 1 (polynomials are continuous, so the graph must cross the x-axis!) Hence we 
have a linear factor and thus the polynomial is reducible. 

4. The polynomial has integer coefficients, 2 divides all of them except that of the leading term and 2 2 = 4 does not divide the 
constant term. Thus, by Eisenstein, the polynomial is irreducible over Q. 

5. We are dealing with a quadratic, so irreducibility becomes a matter of merely checking for roots. In Z7, no element squared 
plus one is equal to zero, so the polynomial is irreducible. 

6. It looks complicated, but we have a cubic, and that means again that all we need do is check for roots, this time in the field 
F of order eight of §4.. In fact, we've gone to all the trouble of writing the + and X tables out, so we may as well use them! 
Somewhat dissapointingly, 1 turns out to be a root, so the polynomial is reducible straight away. We can say a little more 
(although this is not necessary) since no other element of F is a root, and therefore the cubic must factorise into a product of 
a linear and a quadratic factor. If you were sufficiently interested, you could find the irreducible quadratic by long division 
(which works in exactly the same way as long division in the reals, since we are still in a field!) 

(27) We have that 



* / i\(p-i)\ 

for m an integer, as there are clearly an integral number of ways of choosing i objects from p. Thus, 

p\ = mi\(p — 

so that as p divides the left hand side, it also divides mi\(p — As i < p, we can't have p dividing i or any integer less than it, 
hence not dividing i\. Similarly p doesn't divide (p — and so it must divide m (all of which uses the fact that p is prime). 

(29) 

1. Clearly lu is a root of 1" — 1 and 

x « - 1 = ( x - l)(x"- 1 + x"- 2 + --- + X + 1), 
and uj is not a root of x — 1 (since ui being primitive, is not 1), so must be of the desired polynomial. 
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irreducible only if n is odd. 

3. If f(x) = g(x)h(x), then obvioulsy f(x + 1) = g(x + l)h(x + 1), contradicting the irreducibility of f(x + 1). Thus 
/(x) is irreducible too. 

4. We know that 



(!) 



i\(p — i)\ 

is an integer m, sop(p — 1)! = mi\(p — hence p divides one of m, j! or (p — i)l If p\i\ then it divides a j with 1 < j < p 
which cannot be. Similarly, p cannot divide (p — and thus p divides m. 

5. By the above, 

(x + 1)P - 1 



x*> - 1 = ( x - l)$ p (x) => * p (x) 
and using the binomial theorem and cancelling we get 



x p - 1 
x - 1 



* P (x + 1) 



* p (i + 1) = x^" 1 + pxP- 2 + • • • + ^ P . J x"-'- 1 + ■ ■ ■ + p. 
Using Eisenstein, with p as the prime, gives <J> p (x + 1) irreducible by part (d), and hence <& p (x) too by part (c). 



(40) 



1. (1 + x + x 2 ) + (1 + x) = x 2 (since l + l = = x + xinZ2 arithmetic). Similarly, (1 + x + x 2 )(l + x) 
1 + x + x 2 + x + x 2 + = 1 + x 3 = 1 + 1 + x = x (using the rule x 3 = x + 1). 

2. F = {0, l,x,x + l,x 2 ,x 2 + l,x 2 + x, x 2 + x + 1}, so F has eight elements. 

3. The tables are (somewhat tediously!) 



+ 





1 


X 


x + 1 


X 2 


x 2 + l 


X 2 + 


X 


X 2 + x + 1 








1 


X 


x + 1 


X 2 


X 2 + 1 


x 2 + 


X 


x A + X + 1 


1 


1 





x + 1 


X 


x 2 + l 


X 2 


X 2 + X 


+ 1 


X 2 + X 


x 


X 


x + 1 





1 


x+x 


X 2 + X + 1 


X 2 




x 2 + l 


x+1 


x + 1 


X 


1 





X 2 + X + 1 


X 2 + X 


X 2 + 


1 


x" 


X 2 


X 2 


X 2 + 1 


X 2 + X 


X 2 + X + 1 





1 


X 




x + 1 


X 2 + 1 


X 2 +1 


X 2 


X 2 + x + 1 


X 2 + X 


1 





x + 


1 


X 


X 2 + X 


X 2 + X 


X 2 + X + 1 


X 2 


X 2 + 1 


X 


x + 1 







1 


X 2 + X + 1 


X 2 + X + 1 


X 2 + X 


X 2 + 1 


X 2 


x + 1 


X 


1 








Now, (F, +) is an Abelian group for the following reasons: the table closes up (we don't get anything new and unexpected), 
so the field is closed under +; the first row is identical to the indexing along the top, so is the identity under +; each row 
contains somewhere in it, so inverses exist for all elements. Unfortunately, associativity isn't quite so easily established! 



Similarly, 

X 





1 


X 


x + 1 


x 2 


x 2 + 1 


x 2 + X 


x 2 + X + 





























1 





1 


X 


x + 1 


X 2 


x 2 + l 


X 2 + X 


X 2 + X + 


X 





X 


X 2 


X 2 + X 


x + 1 


1 


X 2 + X + 1 


X 2 + 1 


x+1 





x + 1 


x+x 


x 2 + l 


X 2 + X + 1 


X 2 


1 


X 


X 2 





X 2 


x + 1 


X 2 + X + 1 


X 2 + X 


X 


x 2 + l 


1 


x 2 + l 





x 2 + 1 


1 


x 2 


X 


x 2 + X + 1 


x + 1 


x 2 + X 


X 2 + X 





X 2 + X 


X 2 + X + 1 


1 


x 2 + l 


x + 1 


X 


X 2 


X 2 + X + 1 





X 2 + X + 1 


x 2 + l 


X 


1 


X 2 + X 


X 2 


x + 1 



shows that (F \ {0}, X ) is an Abelian group. The distributive law is also a bit tedious! 
Finally, 



1 



1 + x 



x and 



1 



1 + x + x 2 



x A in F, 



from the tables. 



(47) 



1. We certainly have that (0) is an ideal. Suppose that (A) is another one with A^O. For any (lefwe have that /i = ^A -1 A 
(the inverse of A existing as F is a field) so that /i is a multiple of A and hence in the ideal (A) . Thus (A) = F. The conclusion 
is that F contains only the two ideals (0) and F. 

2. We need only show that every non-zero element of R has an inverse under multiplication. Let r ^ be such an element and 
consider the ideal (r). By the restriction on the possible ideals we have either (r) = (0) = {0} or (r) = R. As r £ (r) 
the first one cannot happen so that it is (r ) = R that we have. In particular 1 £ (r ) , ie: there is an s £ R such that sr = 1 
and by commutativity we also have that rs = 1. Thus s is the inverse of r as required. 
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(61) 

1. The polynomial has no roots in F3, so is irreducible as it is a cubic. The quotient ring given is then a field. 

2. We use the division algorithm: 

g(x) + (1 - x + x 3 ) = (q(x)(l - x + x 3 ) + (a + bx + cx 2 )) + (1 - x + x 3 ) = (a + bx + cx 2 ) + (1 - x + x 3 ), 

for any coset g(x) + (1 — x + x 3 ). The uniqueness follows from the fact that the quotient and remainder are uniquely 
determined by the division algorithm (See the first handout on rings). 

3. There are three choices for each of a, b and c in (a + bx + cx 2 ) + (1 — x + x 3 ), so that the field has at most 27 elements. 
On the other hand, suppose 

(ai + bix + ax 2 ) + (f - x + x 3 ) = (a 2 + b 2 x + c 2 x 2 ) + (1 - x + x 3 }, 

for some a; , ft; , c; with i = 1,2. Then, using some of the basic properties of cosets described in the lectures, we get that, 

((ai-a2)+(bi~b2)x+(ci~C2)x 2 )+{l~x+x 3 ) = (1-x+x 3 ) => (ai-a2)+(bi~b2)x+(c 1 -C2)x 2 e (1-x+x 3 ) => 

(the last since two cosets the same means the difference of the representative polynomials is a multiple of f(x)). But the 
degree of (ai — a 2 ) + (fti — 62)^ + (ci — c 2 )x 2 is two, while every muliple of 1 — x + x 3 has degree 3 or more, except 
for one: the zero polynomial. Thus (ai — 02) + (fti — b 2 )x + (ci — c 2 )x 2 must be the zero polynomial, ie a± = 0,2, 
fti = &2 and ci = C2. Thus all the 27 cosets listed are different (there really are 27 of them!) 

(69) Notice that one of the 5-th roots of 1 found in the first question is (after a little massaging) equal to 

V2V5 + V5 

ui = 1 1. 

4 4 

In fact, this is the first vertex anticlockwise around the circle from 1. Now, this number is constructible precisely when its real and 
imaginary parts, 

4 4 

are constructible. But these last two numbers can be obtained from integers using the four field operations and by taking y's, all of 
which we can do with ruler and compass. Thus uj is constructible, hence so is the desired pentagon, just by stepping off the length 
of the line segment joining 1 to uj using your compass. 

(70) If the length of the line segment is x, then the task is to construct ^ and ^ . We can certainly construct i and | using our 
ruler and compass, and we can multiply lengths using these two tools as well. So, multiply the two fractions by the line segment, 
and we're done (notice that we can use this argument to ra-sect a line for any n, ie: divide it into n equal parts). 

(72) The best way to do this part is to use a picture proof that can be run in both directions. Alternatively, one can write out a 
solution in terms of words, and since this easier to KTgX, I'll do it that way. 

If 9 is constructible, then assuming without loss of generality that one side of the angle is the a>axis, the intersection of the 
other side with the unit circle is the point (cos 9, sin 9). Dropping a vertical line to the x-axis gives us cos 9. Conversly, if cos 9 is 
constructible, then so is yl - cos 2 9, as the field of (complex) constructible numbers is closed under taking of square roots. Hence 
sin 9 can be constructed up the ?/-axis, and horizontal and vertical lines determine (cos 9, sin 9), and a line through this point and 
the origin constructs the angle 9. 

(76) We begin by computing (1 + a) -1 . According to the lemma on the structure of simple algebraic extensions, this element 
must have the form ba 2 + ca + d for some uniquely defined ft, c, and d in Q. So we simply need to solve the equation 

(a + l)(fta 2 + ca + d) = 1 

for these rational coefficients. Expanding, we find that 

ba 3 + (6 + c)a 2 + (c + d)a + d=l, 

so that we have three equations for ft, c, and d, namely: 

6 + c = 
c + d = 
2ft + d = 1 

(where in the last equation we've used the fact that a 3 = 2). 

Now, the first two equations imply that ft = — c = d, and hence from the third equation we see that 3ft = 1. Hence, 6 = 1/3, 
c = —1/3, and d = 1/3, so that the required inverse for (a + 1) is 

(a + iy 1 = i(a 2 -a + l). 

(If you like, you can check your answer by multiplying by (a + 1).) 
In a similar manner, we can compute that 

(a 2 + 1)- 1 = i(-a 2 + 2a + l), 
5 

so that, since a 4 + 1 = 2a + 1, we have that 

(a 4 + l)(a 2 + I)" 1 = (2a + l)i(-a 2 +2a + l) 
= ■ • • = I (3a 2 + 4a - 3) . 
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(77) The minimum polynomial for a = \/b over Q is x 3 — 5, so every element of Q(a) has the form a + 6a + ca 2 for a unique 
choice of a, 6 and c in Q. In each case we explicitly use the fact that a 3 = 5. 

1. 

2. This is really easy: a 5 - a 6 = 5a 2 - 25. 

3. Set a /(a 2 + 1) = a + ba + ca 2 , and solve for a, b and c in Q. We multiply through by (a 2 + 1) and see that 

a = (a 2 + l)(a + 6a + ca 2 ) = (a + 56) + (6 + 5c)a + (a + c)a 2 , 

so that equating coefficients reveals the equations 

a + 56 = 6 + 5c = 1 a + c = . 

The last equation forces a = —c, and substituting in, we're left with the two equations a + 5b = and b — 5a = 1. So 
a = —5b and c = 5b and the middle equation becomes 266 = f . Hence 

a/(a 2 + 1) = (-5/26) + (1/26) a + (5/26)a 2 . 



(78) 



Let a = \f2 + \J— 2 = \/2 + i\/2- Then a 2 = 4i, so that a 4 = — 16. Consider the polynomial f(x) = x 4 + 16. This is 
monic and has a as a root. Moreover, we claim that this is irreducible and hence is the correct minimum polynomial. First 
note that f(x) has no roots in Q, because f(a) > 16 for all a in Q. We still need to show that f(x) does not factor as a 
product of quadratics, and this is best done by drawing the roots in the complex plane, being 

2 2 

Any factorisation into quadratics of this polynomial would come from multiplying two terms of the form, 

(x - Ci)0 - C2), 

but these never give polynomials with rational coefficients. For the rest, we use the fact that a 4 = —16. Note that it's not 
at all obvious that the first three numbers actually lie in (Q(a).For the first part, we begin by showing that \/2 £ Q(a). [We 
could just solve as we did in part (c) of problem 3, but hopefully the method we're using will produce the answer almost 
immediately.] We compute that, in addition to the above information, 

a 3 = (v / 2 + *v / 2)(4i) 
= -4 v / 2 + 4j\/2. 

Hence it's easy to see that \/2 = (l/8)(4a — a 3 ). For exactly the same reason, \J —2 = i\/2 = (l/8)(4a + a 3 ). This is 
the easiest of the lot: i = (l/4)a 2 . Since a 4 = —16, we have 

a 5 + 4a + 3 = -16a + 4a + 3 = 3 - 12a . 

We need to find a polynomial g(a) in a which satisfies ag(a) = 1. Since a 4 = —16, we have (— l/16)a 4 = 1, which 
means thatl/a = (-l/16)a 3 . This last is the most complicated. We set (2a + 3)/ (a 2 + 2a + 2) = a + ba + ca 2 +da 3 , 
and solve for a, 6 and c in Q. We multiply through by (a 2 + 2a + 2) and see that 

2a + 3 = (a 2 + 2a + 2)(a + 6a + ca 2 + da 3 ) 

= (2a - 16c - 32d) + (2a + 26 - 16d)a + (a + 26 + 2c)a 2 + (6 + 2c + 2d)a 3 . 

so that equating coefficients reveals the equations 

2a-16c-32d = 3 
2a + 26 - 16d = 2 
a + 26 + 2c = 
6 + 2c + 2d = 

Hence we have 4 equations in 4 unknowns and can solve to find a = —1/2, 6 = 1/6, c = 1/12 and d = —1/6, and so 
(2a + 3)/(a 2 + 2a + 2) = -1/2 + l/6a + l/12a 2 - l/6a 3 . 



a. (79) 

1. Let a = 1 + i. Then a 2 = 2i, so that a 2 — 2a = —2. In particular, a is a root of the polynomial f(x) = x 2 — 2x + 2 
over Q. This is also monic, so we just need to decide whether or not f(x) is irreducible over Q. However, the only way that 
f(x) could be reducible is if it factored as f(x) = (x — a)(x — b) with both a and 6 in Q. But we Fnow that one of these 
roots would necessarily be a, so f(x) is indeed irreducible. Hence f(x) is the required minimum polynomial. 

2. The obvious polynomial to try this time is g(x) = x 3 — 7, since it's monic and clearly has \/7 as a root. Moreover, 
Eisenstein's criterion immediately applies with q = 7 and we see that g(x) is irreducible over Q. Hence g(x) is the required 
minimum polynomial. 

3. Just as in the previous part, the minimum polynomial for \f5 over Q is x 4 — 5. 
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4. This time there's really no obvious choice, so we need to consider relations between powers of a = y/2 + i: 

a = V2 + i 
a 2 = 1 + 2iV2 
a 3 = V2 + i + Ai- 2\f2 

= -a/I + 5? 
a 4 = (1 + 2iv / 2) 2 = 1 - 8 + 4i%/2 

= -7 + W2 

So we notice without too much trouble that a 4 — 2a 2 = —9, so that a is a root of the monic polynomial h(x) = x 4 —2x 2 +9. 
We need to check irreducibility of h(x) over Q. Equivalently, we can work over Z, and begin by checking for linear factors. 
For any root a of h(x) in Z, we must have a|9. It's easy to see that none of ±1, ±3, ±9 is a root. So h(x) has no linear 
factors. If we try to factorise h(x) as 

x A -2x + 9 = (x 2 + ax + b)(x 2 + cx + d) 

= x 4 + (a + c)x 3 + (ac + b + d)x 2 + (ad + bc)x + bd . 

Equating coefficients, we conclude that a = — c, and hence that = ad + be = a(d — b). So either a = or d = b. 
First assume that a = 0, and hence c = 0. The remaining equations then imply that bd = 9 and b + d = —2, and these 
equations have no solution in Z. 

So we must therefore have a ^ 0, and hence d = b. In that case the constant coefficient is b 2 = 9. This forces b to be 3 or 
—3. Consider now the remaining equation, which can be written as b + d = a 2 — 2, or 2b = a 2 — 2. Hence this implies 
that a 2 = 2b + 2. But with our choices for b, we then have a 2 = 8 or a 2 = — 2, neither of which has a solution in Z. 
Therefore, h(x) is irreducible over Q and hence is the required minimum polynomial. 

5. One way to do this part is to argue exactly as we did in part (d). We set a = y/2 + ^3 and compute powers of a, looking 
for a relationship. If we try this, we compute powers up to a 6 and compare coefficients of 1, a, . . . , a 6 to get 6 equations in 
7 unknowns, 

1 = 1 

a = V2+ \/3 
a 2 = 2 + 2^^3+ (^3) 2 
a 3 = 3 + 2v / 2 + 6v / 3 + 3v / 2(v / 3) 2 
a 4 = 4 + 12^2 + 3-^3 + 8^ v / 3 + 12(v / 3) 2 
a 5 = 60 + 4^ + 20^+ 15V2 v / 3 + 3(v / 3) 2 + 20\/2( \^3) 2 
a 6 = 17 + 120v / 2 + 90v / 3 + 24v / 2v / 3 + 60(v / 3) 2 + 18a/2( v^) 2 



and hence the matrix 



Row-reduction yields: 





- 1 
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3 


4 60 


17 









1 





2 


12 4 


120 







1 
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3 20 


90 
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8 15 


24 










1 





12 3 


60 




. 
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20 


18 




" 1 





2 


3 


4 


60 


17 " 







1 





2 


12 


4 


120 










2 





8 


15 


24 













4 


-9 


16 


-30 
















8 


-9 
T 


48 



















755 
"64~ 








Back-substitution finally yields the polynomial 

f(x) = x 6 - 6x 4 - 6x 3 + 12a; 2 - 36x + 1 , 

which is monic and has a as a root. 

Another approach which leads to the same polynomial is to begin with the equation a = \/2 + v^3 and eliminate radicals. 



So write 



Cubing gives 



a- V2 = ^3. 



3V2o 



(a - V2) 3 
I- 6a - 2V2 



(a 3 + 6a) - V2(3c 



-2) 



V2(3a 2 + 2) 



6a — 3 , 



98 



so that squaring gives 

2(3a 2 +2) 2 = (a 3 +6a-3) 2 
2(9a 4 + 12a 2 +4) = a 6 + 36a 2 + 9 + 12a 4 - 6a 3 - 36a . 

Rearranging gives 

a 6 - 6a 4 - 6a 3 + 12a 2 - 36a + 1 = 

which yields the same polynomial. 
It remains to check that 

f(x) = x 6 - 6x 4 - &x z + 12x 2 - 36x + 1 
is irreducible over Q. Reducing mod 3 gives 

J(x) = x 6 + l 

which clearly has no roots in F3. Hence if theis reduces, then it must have a monic, irreducible quadratic or cubic factor. 
Of the 9 possible quadratic polynomials over F3, a quick check shows that only x 2 + 1, x 2 + x + 2 and x 2 + 2x + 2 are 
irreducible, and a little work shows none of these divides f(x). 

A similar argument with cubics shows that f(x) has no cubic factors either, and hence f(x) is irreducible. 

To see that it is, note that since vTT is irrational, so is a. [If you don't believe this, you can see this as follows: ifa = m/n £ Q, 
then we would have vTT = (2m + 3n)/n, contradicting the irrationality of vTT.] (That vTT is irrational follows from the fact 
that the polynomial x — 11, has roots that are either integers or irrational, by a question from the first assignment. Clearly no 
integer can square to give 11, so \/TT must be irrational.) In particular then, a ^ Q, which means that q(x) cannot be factored over 
Q, as such a factorisation would be of the form q(x) = (x — a)(x — 0) for some f3 6 Q. Hence m(x) = q(x) is the minimum 
polynomial for a over Q.As in (b) we set j3 = (i\/3 — l)/2 and work with 2/3. We compute that (2/3) 2 = — 2 — 2iV3, so that 

(2/3) 3 = (iy/3- l)(-2 - 21^) = -21^+2 - 2(-l)(3) + 2iV3 = 8 . 

But that means that 8/3 3 = 8, i.e. /3 3 = 1. Hence f3 is a root of the polynomial p(x) = x :i — 1. However, this isn't the minimum 
polynomial, because p(x) isn't irreducible. Using cyclotomic polynomials (or just by observation), we see that p(x) factors as 

p(x) = 4>\4>z = (x - i)(x 2 + x + 1) . 

The quadratic factor is irreducible, since it's a cyclotomic polynomial. And clearly /3 is a root of this. Therefore the minimum 
polynomial for fi over Q is m(x) = x 2 + x + 1. 

(81) 

1. Let L = Q(V2, \^2), E = Q(V2) and F = Q. Then since the minimum polynomial for VI over Q is x 2 - 2, we see 
that [E : Q] = 2. Moreover a basis for E over Q is { 1, V2 }. 

Now consider the extension E C L. The polynomial g = x :i — 2 is monic and irreducible over E, since its roots are 
v^2, V2oj, and v^w 2 , none of which lies in E (here ui is a primitive 3rd root of unity). So g is the minimum polynomial 
for \[2 over and hence L = E( satisfies [L : E] = dcg(g(x)) = 3. Moreover a basis for L over E is given by 
{1, ^2,(^2) 2 }. 

Therefore by the Tower Law, [L : Q] = 6 and a basis for L over Q is given by 

{1,V2, •& / 2,v / 2^ / 2,(^) 2 ,v / 2(v / 2) 2 }. 

2. Set a = v^2 S K and let F = Q(i) and L = Q(a, i), so that L = F(a). We need to find the minimum polynomial of a 
over F. 

Now a clearly satisfies the polynomial g(x) = x 4 — 2 over F. Moreover, the roots of g(a;) in C are ±a and ±ai, and 
none of these lies in F because a ^ Q. 

Hence is irreducible over F and so is the minimum polynomial for a over F. 
Therefore [L : F] = 4 and a basis for L over F is given by 

{ l,a,a 2 = V2,a :i } . 

3. Let L = Q(£) and consider the extension L/Q. 

Since g is a primitive complex 7th root of unity, its minimum polynomial over Q is &7(x) = x (i +x 5 +x 4 +x 3 +x 2 +x+l. 
Therefore [L : Q] = 6 and a basis for L over Q is given by 

4. Let a; be a primitive complex 3rd root of unity and consider the tower of fields 

Q C Q(y/3) C Q{V3,i) C Q(V3,i,w). 

The minimum polynomial of \/3 over Q is clearly x 2 — 2, as this is monic, irreducible (Eisenstein) and has \/3 as a root. 
Thus {1, \/3} is a basis for Q(\/3) over Q. Similarly, {1, i} is a basis for Q(v / 3, i) over Q(%/3) using the minimum 
polynomial x 2 + 1. 

This gives a basis {l,y/3,i, V3i} for Q(%/3, i) over O. 

An argument similar to part (i) shows that [E : Q] = 4 and that a basis for F over Q is given 

The primitive complex 3rd roots of unity are given by a = (— 1 + «v / 3)/2 and (3 = (— 1 — i\/3)/2, so that u; must be one 
of these. But a and both lie in Q(v / 3, i) ! Hence Q(\/3, i) = Q(- v / 3, i, w). 
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(82) Notice that a 4 = —1, so that a is a root of the polynomial g(x) = x 4 + 1. We use the result of problem 13 on sheet 
1. We saw there that g(x) is irreducible over Q, so that g(x) is the minimum polynomial for a over Q. Hence by Lemma 3.4, 
[Q(a) : Q] = deg{g(x)) =4. 

On the other hand, if F = R, the polynomial g(x) splits into a product of two monic, irreducible quadratics. One of these has a 
as a root, and hence is the minimum polynomial for a over R. Again by Lemma 3.4, [R(a) : R] = 2. 



(89) 

1. If we could construct 40° we could then bisect it to construct 20°. But 20° = 7r/9 is not constructible as 7r/3 cannot be 
trisected. 

2. If 72° and 8° are constructible, then so is 80°, which can be bisected twice to give 20° again. Thus 8° is not constructible if 
72° is. On the otherhand, 120° is definitely constructible (just construct an regular triangle), hence so is 120° — 72° = 48°. 
This can then be bisected to give 24° . Thus 24° is constructible from 72° . 



(91) If we could perform the required task then we could construct an x satisfying 

x 3 (15 + 7 v / 5) 



Rearranging, 



x 3 (15 + 7v / 5) c 3 20 lr 
- 5 => x s = — = 7V5 - 15 



4 (15 + 7^5) 

(multiplying top and bottom line by 15 — 7v / 5). Thus 

(x 3 + 15) 2 = 7 2 x 5 => x 6 + 30a; 3 - 20 = 0. 

This last is irreducible by Eisenstein (using p = 5) and so is the minimum polynomial over Q of the side length x of the 5-fold 
volume dodecahedron. But this is a contradiction, since this number cannot be constructed, as the degree [Q(a;), Q] = 6. 



(119) 

1. We know (Theorem 17) that if L is the splitting field of some polynomial then |Gal (L/Q)\ = [L, Q], the degree of the 
extension Qci. Now, Q(\/2) is the splitting field of the polynomial x 2 — 2, so we have, 

|Gal(Q(V2)/Q)| = [Q(V2),Q]. 

Since the right hand side is a simple extension, Theorem 15 gives that [Q(%/2), Q] is equal to the degree of the minimum 
ploynomial over Q of \f2. This is obviously x 2 — 2 (its monic, irreducible, £ Q[x] and has \/2 as a root), so 

|Gal(Q(%/2)/Q)| = mV2),Q] = dcg(x 2 - 2) = 2, 

as required. 

Parts (c) and (d) are exactly the same: in (c) we have the splitting field of 1 + x + x 2 (since the element adjoined to Q is a 
3-rd root of unity) while in (d) we have the splitting field of x 3 — 2 of the first lecture. In this last case though we also need 
to use the tower law (Theorem 16), 

|Gal(Q(^2,-i + ^i)/Q)| = [Q(^2,-I + ^i),Q] = [Q( ^2, - \ + Q(-i + ^i)][Q(-i + ^i), Q]. 

Work out each of these in turn using Theorem 15. 

2. Here Theorem 17 is of no use as Q( \/2) is not the splitting field of anything! (Can you see why?) But, it is easy to do 
anyway. Any automorphism in Gal (Q( \/2) / Q) must permute the roots of any polynomial that \p2 is a root of, hence must 
permute the roots of x 3 — 2. But the automorphism must also send Q( \/2) C R into itself, and since the other two roots of 
x 3 — 2 are complex, must in fact send \pl to itself. It must then send ( \/2) 2 to itself as well, and these two, with 1 form a 
basis for Q( \/2), so our automorphism must be the identity. 



(120) 

1. Trick question: the root of x — 2 is 2 S Q, so the splitting field is just Q. Since every element of Gal (L, Q) must fix all 
the elements of Q pointwise, we get |Gal (L, Q)| = 1. 

2. Another trick question: this is just question 3(d) as L = (Q>( \/2, — | + ^i). 

3. The polynomial has splitting field Q(a, lj) where a = \/2 and 

2tt 2tt 
ui = cos h i sin — . 

5 5 

By Theorem 17 and the tower law we get, 

|Gal(L,Q)| = [®(a,w),Qt] = [®(a,w), Q(uj)][Q(uj), Q]. 

Each of the terms on the right hand side is the degree of a simple extension, so we use Theorem 15: [Q(w), Q] = 4 since 
oj has minimum polynomial 1 + x + x 2 + x 3 + x 4 (not x 5 — 1!). Slightly trickier is the fact that [Q(a, w), Q(u))] = 5. 
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In fact the minimum polynomial over Q(oj) of a is indeed x'' — 2 for which it is sufficient to show that this polynomial is 
irreducible over Q(lo). 

To do this, we first need that no root of x'' — 2 is in Q(u>). These roots are a, au), . . . , au) 4 . If au' £ <Q(u), then 
aw % u)~ % is too, ie: a is. We can probably beleive that this is not the case (see me for a more rigorous statement!). Since 
the polynomial has degree 5, checking the roots is not enough, it could factorsise into non-linear factors, but these must be 
a quadratic and a cubic. In fact, the quadratic must be of the form 

(x — au*)(i — au') = x 2 — (au) 1 + aui^x + a 2 u/ + -' . 

Thus must be a polynomial over <Q(w), so in particular, a 2 ui %+ ^ £ Q(oj). But then similarly, a 2 £ Q(u) => a 6 = a £ 
Q(llj). We have already "convinced ourselves" that this isn't so. 

4. By Theorem 17 again, 

Gal (L/Q) = [L,Q], 

where L = Q(u>) with ui a primitive 5-th root of unity. The polynomial given is the minimum polynomial of ui over Q, so 
we have 

| Gal (L/Q) | = deg(l + x + x 2 + x 3 + x 4 ) = 4. 

See also question 9. 

5. By the hint, the roots of 1 + x 2 + x 4 are the roots of x® — 1 that are not ±1, hence are ui, ui 2 , ui 4 and ui'' , with ui a primitive 
6-th root of unity. Consider F = Q(cj). Then clearly these roots are in F, so that F contains the splitting field. On the 
otherhand, the splitting field contains Q (since any subfield of C does) and must contain the root ui. Thus F is contained in 
the splitting field, ie: it is the splitting field. 

Thus the order of the Galois group is equal to the degree [F, Q] = [Q(u>), Q] which in turn is equal to the degree of the 
minimum polynomial (over Q) of oj. One may be tempted to guess 1 + x 2 + x 4 for this, but, 

1 + x 2 + x 4 = [x 2 + x + l)(x 2 - X + 1), 

so is not irreducible. Your next guess, x 2 + x + 1 would be correct as its roots, ui and ui 4 , are Q. 
Thus the order of the Galois group is 2. 

(121) 

1. Q(cos ^ + isin ^) is the splitting field of the p-th cyclotomic polynomial $ p (x) = 1 + x + x 2 + ■ ■ ■ + since 

(x- l)$ p (x) =x p - 1. 

Thus the order of the group is equal to the degree of the extension which, being simple, can be deduced from Theorem 15 
and the fact that <J> P is the minimum polynomial over Q of the element being adjoined (where we have used assignment 3, 
number 1 again to get that <J> P is irreducible). We thus get that the Galois group has order p — 1 as claimed. 

2. This is entirely analagous to question 4(c), except a = ^2 and u) is a primitive p-th root of 1. 
(128) 

1. The Galois group of L over k is the group of all automorphisms of the field L that leave the subfield k fixed pointwise. 

2. Observe first that a basis for Q(a, i) over Q is given by 

{1, a, a 2 , a 3 ,i, ai, a 2 i, a 3 i} 
and the effect of the eight automorphisms on a and i is given by 





1 


cr 


a 2 


a 3 




cr 


a 2 


a 3 


a 
i 


a 

i 


ai 
i 


— a 
i 


— ai 
i 


a 
—i 


ai 
—i 


—a 
—i 


— ai 
—i 



Hence the automorphism are distinct and by their effect on a and i, we see that a = a 3 . 

3. Hi = {1, cr 2 , cr, <r 3 }, H 2 = {l,cr 2 } and H3 = {1, cr}. By the Galois correspondence, we have [Q(a, i), Q] is equal 
to the index of {1} in G, ie the order of G, which is 8. The tower law gives [F lt Q] = [F 1: <Q(iy/2)] [Q(i\/2), Q] with 
[F\ , Q] = 4 and [Q(i\/2), Q] = 2 since the corresponding Galois groups have these indices in G. Hence [F\ , Q(i\/2)] = 
2. 

To describe the fields we use the fact that any element x of Q(«, i) can be written uniquely in the form 

x = ao + a\a + a^a 2 + a^a 3 + a^i + a^ai + a^a 2 i + a^a 3 i, 

for some a 4 £ Q. The Galois group of Q(a, i) over Fi is {1, cr 2 } from the lattice diagram and the Galois correspondence. 
Hence every element of F\ is fixed by cr 2 , where 

2 / \ 2 3 2 ■ 3 ■ 

cr [x) = ao — a\a + a^a — 0,3a + a^i — a$ai + a^a % — a^a %, 

and equating this with the previous expression we see that such an x must satisfy a,2i = a^i and a2i+i = — ci2i+i, so that 
ai = C13 = as = aj = while ao, ct2, 04 and ag are arbitrary. Hence we have 

x = ao + a^a 2 + a^i + aea 2 i, 



101 



and clearly Fi C Q(a 2 ,i). On the otherhand, a 2 , i g Fi and so Q(a 2 , i) C F 1 . Thus Fi = Q(a 2 ,i). 

To get the other one, notice that does not fix hence the Galois group of Q(a, i) over Q(i%/2) must be Hi so that the 

Galois group of Q(a, i) over F2 is {1, <r 2 , , <r 2 }. Running through the calculation above gives 

2/ \ 2 3 ■ 2 ■ 3 - 

(J (a: J = ao — aia + a2» — Q>3& + <^4^ — a$ai + ct6 a ^ ~ a>7& 2, 

and 

(x) = ao + aio + 3.2a 2 + a3« 3 — a4i — ^ai — a6« 2 i — a7« 3 i, 
giving, ai = 03 = 05 = 07 = 04 = a6 = 0, hence an x 6 i<2 is of the form 

a: = a + a 2 a 2 , 

and F2 = HHv 7 ^) for the same reasons as before. 

(129) 

1 . The Galois group of L over F is the group of all automorphisms of the field L that leave the subfield F fixed pointwise. 

2. The polynomial in question has roots ui, uj 2 , u> 3 , u> 4 , u> 5 and lj 6 , and these are clearly all in Q(u>). On the otherhand, if F 
is any field containing the roots of the polynomial, then F certainly contains u> (as this is one of them) and it also contains 
Q (assignment question done in the problems class). But w £ F and Q C F means Q(w) C F, so that it is indeed the 
smallest field containing the roots. The order of the Galois group now follows immediately since, 

|Gal (Q(u;)/Q)| = [Q(u),Q] = deg(l + x + x 2 + x 3 + x 4 + x 5 + x 6 ) = 6, 

since the polynomial is the minimum polynomial for ui over Q. 

3. A basis for Q(u) is given by 

{l,w,u 2 ,w A ,w A ,iJ'}, 

and so any automorphism is determined by is effect on these basis vectors. In fact, any automorphism is determined by its 
effect on u> alone. On the otherhand, any automorphism must permute the roots of any polynomial that oj is a root of, eg: 
1 + x + x 2 + x :i + x 4 + x'' + x 6 with roots ui, ui 2 , uj :i , u! 4 , u> 5 and ui 6 . Combining all this with the fact that there are 
exactly 6 automorphisms means that they are precisely the maps that send u> to one of these 6 roots. If a sends uj to uj 3 as 
stated, then a 2 (u>) = (uj 3 ) 3 = uj 9 = ui 2 , and a' 3 (c<j) = u) 27 = uj 6 . Thus by Lagrange, a has order 6 in the Galois group, 
which must then be cyclic as claimed. 

4. Clearly any subgroup containing a contains everything. Similarly, since the powers of a° yield all the elements of the 
group, any subgroup containing a'' is the whole group. Thus, for a proper subgroup we must not include a or <x 5 . If we 
don't include a 2 we get the subgroup {1} or {1, <r 3 }, whereas if we do, we get the subgroup {1, a 2 , u 4 }. 

By the Galois correspondence, we get the lattice of intermediate subfields as claimed, with Fi the fixed field of a subgroup 
of index 2, hence of {1, a 2 , a 4 }, and F2 the fixed field of {1, a 3 }. 
Now any element of Q(u>) can be written as 

x = ao + aiu> + a,2UJ 2 + a-^ui 3 + a^u 4 + a$u> 5 
with the Oj £ Q. We require x such that a 3 (x) = x, where 

<r 3 (ao + a\u> + a,2Ui 2 + a-jut 3 + a^u) 4 + ago/') = ao + 01 (— 1 — u) — ■ ■ ■ — w 5 ) + 

a2^ 5 + a-^ui 4 + a4<^ 3 + ar,ui 2 

and we have by equating coefficients that ao — ai = ao, ai = — 01, as — ai = 02, 04 — ai = 03, 0,3 — a\ =0,4, and 
a2 — ai = as. Thus x must have the form 

x = a + a 2 (u) 2 + u) 5 ) + a 3 (uj 3 + uj 4 )- 

Hence the fixed field is C Q(oj 2 + uj 5 , uj 3 + uj 4 ). On the otherhand, a 3 fixes both these elements and thus Q(oj 2 + 
u)-',w 3 + u 4 ) C the fixed field, giving F 2 = Q(u) 2 + uj 5 , w 3 + u) 4 ). 

(131) 

1. The polynomial in question has roots ui, lu 2 ,lu 3 and u> 4 , and these are clearly all in Q(oj). On the otherhand, if F is any 
field containing the roots of the polynomial, then F certainly contains u) (as this is one of them) and it also contains Q (see 
question 3, assignmentl). But u> e F and Q C F means Q(v) C F, so that it is indeed the smallest field containing the 
roots. 

2. This follows immediately since, 

|Gal (QM/Q)| = [QH, Q] = deg(l + x + x 2 + x 3 + x 4 ) = 4, 
since the polynomial is the minimum polynomial for u) over Q. 

3. A basis for Q(ui) is given by 

{1,uj,uj 2 ,uj 3 }, 

and so any automorphism is determined by is effect on these basis vectors. In fact, any automorphism is determined by its 
effect on ui alone. On the otherhand, any automorphism must permute the roots of any polynomial that a; is a root of, eg: 
1 + x + x 2 + x 3 + x 4 with roots u>, ui 2 , uj 3 and ui 4 . Combining all this with the fact that there are exactly 4 automorphisms 
means that they are precisely the maps that send ui to ui or ui 2 or ui 3 or uj 4 . If a sends ui to u> 2 as stated, then a 2 sends ui 
to uj 4 , a 3 sends ui to uj 3 , and a 4 send u> to 1. 
Thus, the Galois group has the elements as stated. 
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4. Clearly any subgroup containing a contains everything. Similarly, since the powers of <r 3 yield all the elements of the 
group, any subgroup containing <r 3 is the whole group. Thus, for a proper subgroup we must not include a or ct 3 . If we 
don't include a 2 we get the subgroup {1}, whereas if we do, we get the subgroup {1, a 2 }. Thus the lattice has only one 
subgroup apart from the two obvious ones. 

5. This follows imediately by the Galois correspondence and since the subgroup lattice has only one subgroup apart from {1} 
and G. Now any element of Q(w) can be written as 

x = ao + aiw + a,2U) 2 + a^uj 3 , 

with the a,i £ Q, and the intermediate field we are after is the fixed field of the subgroup {1, a 2 }. That is, we have 
a 2 (x) = x for all x 6 Q(u>). On the otherhand, 

a 2 (ao + aiu> + a2Ui 2 + a^ui 3 ) = ao + aiu> 4 + a2W 3 + a^u) 2 = ao + oi(— 1 — u) — ui 2 — + a2U) i + a^ui 2 

so that, 

a 2 (x) = (ao — ai) — aia; + (as — a\)ui 2 + (02 — ai)w 3 , 

and we have by equating coefficients that ao = ao — ai , ai = ai , 02 =0,3 — 0,1 and 03 = 02 — ai . Thus x must have 
the form 

x = a + buj 2 + bu)' i = a + b(u> 2 + a; 3 ) e Q(lu 2 + a; 3 ). 

Hence the fixed field is C Q(lo 2 + u)' A ). On the otherhand, cr 2 (u) 2 + a; 3 ) = u) 2 + uj 3 , and so cr 2 must fix <Q(oj 2 + w 3 ) 
pointwise. Thus Q(u) 2 + w 3 ) C the fixed field, and Q(u) 2 + uj s ) is the field we seek. 
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